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Gene Then^y vectors, 
which are especially useful for 
cystic fibrosis, and methods for 
using the vectors are disclosed. 
In preferred embodiments, the 
vectors are adenovirus-based. 
Advantages of adenovirus-based 
vectors for gene therapy are that 
they appear to be relatively safe 
and can be manipulated to encode 
the desired gene product and 
at the same tune are inactivated 
in terns of their, ability to 
replicate in a nomial lytic 
viral life cycle. Additiondly, 
adenovirus has a natural 
tropism for airway epithelia. 
Therefore, adenovirus-based 
vectors are particularly prefeired 
for respiratory gene therapy 
applications such as gene 
therapy for cystic fibrosis. 
In one embodiment, the 
adenovirus-based gene therapy 

vector COTiprises an adenovirus 

2 serotype genome in which 
the Ela and Elb regions of the 
genome, which are involved in 

early stages of viral replication have been deleted and replaced by genetic material of interest (e.g., DNA encoding the cystic fibrosis 
transmembrane regulator protein). In another embodiment, the adenovirus-based therapy vector b a pseudo-adodoviius (PAV). PAVs 
contain no potentially hamiful viral genes, have a theoretical capacity for foreign material of nearly 36 kb, may be produced in reasonably 
high titers and fnaintnin the tropism of the parent adenovirus for dividing and non-dividing human taiget cell types. 
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GENE THERAPY FOR CYSTIC FIBROSIS 



Related Applications 

This application is a continuation-in-part application of United States Serial Number 
5 08/130,682, filed on October 1, 1993 which is a continuation-in-part application of United 
States Serial Number 07/985,478, filed on December 2, 1992, which is a continuation-in-part 
application of United States Serial Number 07/613,592, filed on November 15, 1990, which 
is in turn a continuation-in-part application of United States Serial Number 07/589,295, filed 
on September 27, 1990, which is itself a continuation-in-part application of United States 

10 Serial Number 07/488,307, filed on March 5, 1990. The contents of all of the above co- 
pending patent applications are incorpomted herein by reference. Definitions of language or 
terms not provided in the present application are the same as those set forth in the copending 
applications. Any reagents or materials used in the examples of the present application 
whose source is not expressly identified also is the same as those described in the copending 

1 5 application, e.g., AF508 CFTR gene and CFTR antibodies. 

Ba^fkgroHinid of ftg InventjjQn 

Cystic Fibrosis (CF) is the most common fatal genetic disease in humans (Boat, T.F. 
et al. in The Metabolic Basis of Inherited Diseases (Scriver, C.R. et al. eds., McGraw-Hill, 

20 New York (1989)). Approximately one m every 2,500 infants in the United States is bom 
with the disease. At the present time, there are approximately 30,000 CF patients in the 
United States. Despite current standard therapy, the median age of survival is only 26 years. 
Disease of the pulmonary airways is the major cause of morbidity and is responsible for 95% 
of the mortality. The first manifestation of lung disease is often a cough, followed by 

25 progressive dyspnea. Tenacious sputum becomes purulent because of colonization of 

Staphylococcus and then with Pseudomonas. Chronic bronchitis and bronchiectasis can be 
partially treated with current therapy, but the course is punctuated by increasingly frequent 
exacerbations of the pulmonary disease. As the disease progresses, the patient's activity is 
progressively limited. End-stage lung disease is heralded by increasing hypoxemia, 

30 pulmonary hypertension, and cor pulmonale. 

The upper airways of the nose and sinuses are also involved in CF. Most patients 
with CF develop chronic sinusitis. Nasal polyps occur in 15-20% of patients and are 
common by the second decade of life. Gastrointestinal problems are also frequent in CF; 
infants may suffer meconiiun ileus. Exocrine pancreatic insufficiency, which produces 

35 symptoms of malabsorption, is present in the large majority of patients with CF, Males are 
almost xmiformly infertile and fertility is decreased in females. 

Based on both genetic and molecular analyses, a gene associated with CF was isolated 
as part of 21 individual cDNA clones and its protein product predicted (Kerem, B.S. et al. 
(1989) Science 245:1073-1080; Riordan, J.R. et al. (1989) Science 245:1066-1073; 
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Rommens, J.M, et al. (1989) Science 245:1059-1065)). United States Serial Number 
07/488,307 describes the construction of the gene into a continuous strand, expression of the 
gene as a functional protein and confirmation that mutations of the gene are responsible for 
CF. (See also Gregory, RJ. et al. (1990) Nature 347:382-386; Rich, D.P. et aL (1990) Nature 
5 347:358-362). The co-pending patent application also discloses experiments which show that . ♦ 
proteins expressed from wild type but not a mutant version of the cDNA complemented the 
defect in the cAMP regulated chloride channel shown previously to be characteristic of CF. • 

The protein product of the CF associated gene is called the cystic fibrosis 
transmembrane conductance regulator (CFTR) (Riordan, j.R. et al. (1989) Science 245:1066- 

10 1 073). CFTR is a protein of approximately 1480 amino acids made up of two repeated 
elements, each comprising six transmembrane segments and a nucleotide binding domain. 
The two repeats are separated by a large, polar, so-called R-domain containing multiple 
potential phosphorylation sites. Based on its predicted domain structure, CFTR is a member 
of a class of related proteins which includes the multi-drug resistance (MDR) or P- 

15 glycoprotein, bovine adenyl cyclase, the yeast STE6 protein as well as several bacterial 

amino acid transport proteins (Riordan, J.R. et al. (1989) Science 245:1066-1073; Hyde, S.C. 
et al. (1990) Nature 346:362-365). Proteins in this group, characteristically, are involved in 
pumping molecules into or out of cells. 

CFTR has been postulated to regulate the outward flow of anions from epithelial cells 

20 in response to phosphorylation by cyclic AMP-dependent protein kinase or protein kinase C 
(Riordan, J.R. et al. (1989) Science 245:1066-1073; Welsh, 1986; Frizzell, R.A. et al. (1986) 
Science 233:558-560; Welsh, M.J. and Liedtke, CM, (1986) Nature 322:467; Li, M. et al. 

(1988) Nature 331:358-360; Huang, T-C. et al. (1989) Science 244:1351-1353). 
Sequence analysis of the CFTR gene of CF chromosomes has revealed a variety of 

25 mutations (Cutting, G.R. et al. (1990) Nature 346:366-369; Dean, M. et al. (1990) Cell 

61 :863-870; and Kerem, B-S. et al. (1989) Science 245:1073-1080; Kerem, B-S. et al. (1990) 
Proc, Natl Acad Scl USA 87:8447-8451). Population studies have indicated that the most 
conmion CF mutation, a deletion of the 3 nucleotides that encode phenylalanine at position 
508 of the CFTR amino acid sequence (AF508), is associated with approximately 70% of the 
. 30 cases of cystic fibrosis. This mutation results in the failure of an epithelial cell chloride 

channel to respond to cAMP (Frizzell R.A. et al. (1986) Science 233:558-560; Welsh, M.J. 
(1986) Science 232:1648-1650.; Li, M. et al. (1988) Nature 331:358-360; Quinton, P.M. 

(1989) Clin, Chem, 35:726-730). In airway cells, this leads to an imbalance in ion and fluid 
transport. It is widely believed that this causes abnormal mucus secretion, and ultimately 

35 results in pulmonary infection and epithelial cell damage. 

Studies on the biosynthesis (Cheng, S.H. et al. (1990) Cell 63:827-834; Gregory, R.J. 
et al. (1991) M?/. Cell Biol 1 1 :3886-3893) and localization (Denning, G.M, et al, (1992) J. 
Cell Biol 1 18:551-559 ) of CFTR AF508, as well as other CFTR mutants, indicate that many 
CFTR mutant proteins are not processed correctly and, as a result, are not delivered to the 
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plasma membrane (Gregory, R.J. et al. (1991) MoL Cell Biol 1 1 :3886-3893). These 
conclusions are consistent with earlier functional studies which failed to detect cAMP- 
stimulated CI- channels in cells expressing CFTR AF508 (Rich, D.P. et al. (1990) Nature 
347:358-363; Anderson, M.P. et al. (1991) Science 251:679-682). 
5 To date, the primary objectives of treatment for CF have been to control infection, 

promote mucus clearance, and improve nutrition (Boat, T.F. et al. in The Metabolic Basis of 
Inherited Diseases (Scriver, C.R. et al. eds., McCh-aw-Hill, New York (1989)). Intensive 
antibiotic use and a program of postural drainage with chest percussion are the mainstays of 
therapy. However, as the disease progresses, frequent hospitalizations are reqmred. 

10 Nutritional reghnens mclude pancreatic enzymes and fat-soluble vitamins. Bronchodilators 
are used at times. Corticosteroids have been used to reduce inflammation, but they may 
produce significant adverse effects and their benefits are not certain. In extreme cases, lung 
transplantation is sometimes attempted (Marshall, S. et al. (1990) Chest 98:1488). 
Most efforts to develop new therapies for CF have focused on the pulmonary 

1 5 complications. Becaxise CF mucus consists of a high concentration of DNA, derived from 
lysed neutrophils, one approach has been to develop recombinant human DNase (Shak, S. et 
al. (1990) Proc. Natl. Sci. Acad USA 87:9188). Preliminary reports suggest that aerosolized 
enzyme may be effective in reducing the viscosity of mucus. This could be helpful in 
clearing the airways of obstruction and perhaps in reducing infections. In an attempt to limit 

20 damage caused by an excess of neutrophil derived elastase, protease inhibitors have been 

tested. For example, alpha- 1 -antitrypsin purified from human plasma has been aerosolized to 
deliver enzyme activity to lungs of CF patients (McElvaney, N. et al. (1991) The Lancet 
337:392). Another approach would be the use of agents to inhibit the action of oxidants 
derived from neutrophils. Although biochemical parameters have been successfully 

25 measured, the long term beneficial effects of these treatments have not been established. 

Using a different rationale, other investigators have attempted to use pharmacological 
agents to reverse the abnormally decreased chloride secretion and increased sodiimi 
absorption in CF airways. Defective electrolyte transport by airway epithelia is thought to 
alter the composition of the respiratory secretions and mucus (Boat, T.F. et al. in The 

30 Metabolic Basis of Inherited Diseases (Scriver, C.R. et al. eds., McGraw-Hill, New York 
(1989); Quinton, P.M. (1990) FASEBl 4:2709-2717). Hence, pharmacological treatments 
aimed at correcting the abnoraialities in electrolyte transport could be beneficial. Trials are in 
progress with aerosolized versions of the drug amiloride; amiloride is a diuretic that inhibits 
sodium channels, thereby inhibiting sodium absorption. Initial results indicate that the drug 

35 is safe and suggest a slight change in the rate of disease progression, as measured by lung 

fimction tests (Knowles, M. et al. (1990) K Eng, J. Med 322: 1 1 89-1 194; App, E.(1990) Am, 
Rev, Respir. Dis, 141 :605). Nucleotides, such as ATP or UTP, stimulate purinergic receptors 
in the airway epithelium. As a result, they open a class of chloride channel that is different 
from CFTR chloride channels. Jn vitro studies indicate that ATP and UTP can stimulate 
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chloride secretion (Knowles, M. et al. (1991) N, Eng. J. Med 325:533). Preliminary trials to 
test the ability of nucleotides to stimulate secretion in vivOy and thereby correct the electrolyte 
transport abnormalities are underway. 

Despite progress in therapy, cystic fibrosis remains a lethal disease, and no current 
5 therapy treats the basic defect However, two general approaches may prove feasible. These 
are: 1) protein replacement therapy to deliver the wild type protein to patients to augment 
their defective protein, and; 2) gene replacement therapy to deliver wild type copies of the CF 
associated gene. Since the most life threatening manifestations of CF mvolve pulmonary 
complications, epithelial cells of the upper airways are appropriate target cells for therapy. 

10 The feasibility of gene therapy has been established by introducing a wild type cDNA 

into epithelial cells from a CF patient and demonstrating complementation of the halhnark 
defect in chloride ion transport (Rich, D.P. et al. (1990) Nature 347:358-363 ). This initial 
work involved cells in tissue culture, however, subsequent work has shown that to deliver the 
gene to the airways of whole animals, defective adenovkuses may be useful (Rosenfeld, 

15 (1992) Cell 68:143-155). However, the safety and effectiveness of usmg defective 
adenoviruses remain to be demonstrated. 



In general, the instant invention relates to vectors for transferring selected genetic 

20 material of interest (e.g., DNA or RNA) to cells in vivo. In preferred embodiments, the 

vectors are adenovinis-based. Advantages of adenovirus-based vectors for gene therapy are 
that they appear to be relatively safe and can be manipulated to encode the desired gene 
product and at the same time are inactivated in terms of their ability to replicate in a normal 
lytic viral life cycle. Additionally, adenovirus has a natural tropism for airway epithelia. 

25 Therefore, adenovirus-based vectors are particularly preferred for respiratory gene therapy 
applications such as gene therapy for cystic fibrosis. 

In one embodiment, the adenovirus-based gene therapy vector comprises an 
adenovirus 2 serotype genome in which the Ela and Elb regions of the genome, which are 
involved in early stages of viral replication have been deleted and replaced by genetic 

30 material of interest (e.g., DNA encoding the cystic fibrosis transmembrane regulator protein). 

In another embodiment, the adenovirus-based therapy vector is a pseudo-adenovirus 
(PAV). PAVs contain no potentially harmful viral genes, have a theoretical capacity for 
foreign material of nearly 36 kb, may be produced in reasonably high titers and maintain the 
tropism of the parent adenovirus for dividing and non-dividing h\raian target cell types. 

35 PAVs comprise adenovirus inverted terminal repeats and the minimal sequences of a wild- 
type adenovirus type 2 genome necessary for efficient replication and packaging by a helper 
virus and genetic material of interest. In a preferred embodiment, the PAV contains 
adenovirus 2 sequences. 
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In a further embodiment, the adenovirus-based gene therapy vector contains the open 
reading frame 6 (0RF6) of adenoviral early region 4 (E4) from the E4 promoter and is 
deleted for all other E4 open reading frames. Optionally, this vector can include deletions in 
the El and/or E3 regions. Alternatively, the adenovirus-based gene therapy vector contains 
5 the open reading frame 3 (0RF3) of adenoviral E4 from the E4 promoter and is deleted for all 
other E4 open reading frames. Again, optionally, this vector can include deletions in the El 
and/or E3 regions. The deletion of non-essential open reading frames of E4 increases the 
cloning capacity by approximately 2 kb without significantly reducing the viability of the 
virus in cell culture. In combination with deletions in the El and/or E3 regions of adenovirus 
1 0 vectors, the theoretical insert capacity of the resultant vectors is increased to 8-9 kb. 

The invention also relates to methods of gene therapy using the disclosed vectors and 
genetically engineered cells produced by the method. 

BrirfPestnptioffi of the TaMfs upd Prawfnes 
1 5 Further understanding of the invention may be had by reference to the tables and 

figures wherein: 

Table I shows CFTR mutants wherein the known association with CF (Y, yes or N, 
no), exon localization, domain location and presence (+) or absence (-) of bands A, B, and C 
20 of mutant CFTR species is shown. TM6, indicates transmembrane domain 6; NBD 

nucleotide binding domain; ECD, extracellular domain and Term, termination at 21 codons 
past residue 1337; 

Table II shows the nucleotide sequence of Ad2/CFTR-1 ; 

25 

Table IH depicts a nucleotide analysis of Ad2-ORF6/PGK-CFTR; 

The convention for naming mutants is first the amino acid normally found at the 
particular residue, the residue number (Riordan, T.R. et al. (1989) Science 245:1066-1073). 
30 and the amino acid to which the residue was converted. The single letter amino acid code is 
used: D, aspartic acid; F, phenylalanine; G, glycine; I, isoleucine; K, lysine; M, methionine; 
N, asparagine; Q, glutamine; R, arginine; S, serine; W, tryptophan. Thus G551D is a mutant 
in which glycine 551 is converted to aspartic acid; 

35 Figure 1 shows alignment of CFTR partial cDNA clones used in construction of 

cDNA containing complete coding sequence of the CFTR, only restriction sites relevant to 
the DNA constructions described below are shown; 



Figure 2 depicts plasmid construction of the CFTR cDNA clone pKK-CFTRl; 
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Figure 3 depicts plasmid construction of the CFTR cDNA clone pKK-CFTR2; 
Figure 4 depicts plasmid construction of the CFTR cDNA clone pSC-CFTR2; 

5 

Figure 5 shows a plasmid map of the CFTR cDNA clone pSC-CFTR2; 

Figxire 6 shows the DNA sequence of synthetic DNAs used for insertion of an intron 
into the CFTR cDNA sequence, with the relevant restriction endonuclease sites and 
1 0 nucleotide positions noted; 



Figures 7A and 7B depict plasmid construction of the CFTR cDNA clone pKK- 
CFTR3; 

1 5 Figure 8 shows a plasmid map of the CFTR cDNA pKK-CFTR3 containing an intron 

between nucleotides 1716 and 1717; 

Figure 9 shows treatment of CFTR with glycosidases; 

20 Figures 1 OA and 1 OB show an analysis of CFTR expressed from COS-7 transfected 

cells; 



Figures 1 1 A and 1 IB show pulse-chase labeling of wild type and AF508 mutant 
CFTR in COS-7 transfected cells; 

25 

Figures 12A-12D show immxmolocalization of wild type and AF508 mutant CFTR; 
and COS-7 cells transfected with pMT-CFTR or pMT-CFTR-AF508; 

Figure 13 shows an analysis of mutant forms of CFTR; 

. 30 

Figure 14 shows a map of the first generation adenovirus based vector encoding 
CFTR(Ad2/CFTR-l); 



Figure 15 shows the plasmid construction of the Ad2/CFTR-1 vector; 

35 

Figure 16 shows an example of UV fluorescence from an agarose gel electrophoresis 
of products of nested RT-PCR from lung homogenates of cotton rats which received 
Ad2/CFTR-L The gel demonstrates that the homogenates were positive for virally-encoded 
CFTRmRNA; 
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Figure 17 shows an example of UV fluorescence from an agarose gel electrophoresis 
of products of nested RT-PCR from organ homogenates of cotton rats. The gel demonstrates 
that all organs of the infected rats were negative for Ad2/CFTR with the exception of the 
5 small bowel; 

Figures 1 8A and 18B show differential cell analyses of bronchoalveolar lavage 
specimens from control and infected rats. These data demonstrate that none of the rats 
treated with Ad2/CFTR-1 had a change in the total or differential white blood cell count 4, 
10 10, and 14 days after infection (Figure 18A) and 3, 7, and 14 days after infection (Figure 
18B); 

Figure 19 shows hematoxilyn and eosin stamed sections of cotton rat tracheas from 
both treated and control rats sacrificed at different time points after infection with 
1 5 Ad2/CFTR-1 . The sections demonstrate that there were no observable differences between 
the treated and control rats; 

Figures 20A and 20B show examples of UV fluorescence from an agarose gel ^ 
electrophoresis, stained with ethidium bromide, of products of RT-PCR from nasal brushings 
20 of Rhesus monkeys after application of Ad2/CFTR-1 or Ad2/p-Gal; 

Figure 21 shows lights microscopy and immxmocytochemistry from monkey nasal 
brushings. The microscopy revealed that there was a positive reaction when nasal epithelial 
cells from monkeys exposed to Ad2/CFTR-1 were stained with antibodies to CFTR; 

25 

Figure 22 shows inamunocytochemistry of monkey nasal turbinate biopsies. This 
microscopy reveals increased immunofluorescence at the apical membrane of the surface 
epithelium from biopsies obtained from monkeys treated with Ad2/CFTR-1 over that seen at 
the apical membrane of the surface epithelium from biopsies obtained from control monkeys; 

30 

Figures 23A-23D show serum antibody titers in Rhesus monkeys after three vector 
administrations. These graphs demonstrate that all three monkeys treated with Ad2/CFTR-1 
developed antibodies against adenovirus; 

35 Figure 24 shows hematoxilyn and eosin stained sections from monkey medial 

turbinate biopsies. These sections demonstrate that turbinate biopsy specimens from control 
monkeys could not be differentiated from those from monkeys treated with Ad2/CFTR-1 
when reviewed by an independent pathologist; 
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Figxires 25A-25I show photomicrographs of human nasal mucosa immediately before, 
during, and after Ad2/CFTR-1 application. These photomicrographs demonstrate that 
inspection of the nasal mucosa showed mild to moderate erythema, edema, and exudate in 
patients treated with Ad2/CFTR-1 (Figures 25A-25C) and in control patients (Figures 25G- 
5 251). These changes were probably due to local anesthesia and vasocontriction because when 
an additional patient was exposed to Ad2/CFTR in a method which did not require the use of 
local anesthesia or vasoconstriction, there were no symptoms and the nasal mucosa appeared 
noraial (Figures 25D-25F); 

1 0 Figure 26 shows a photomicrograph of a hematoxilyn and eosin stained biopsy of 

human nasal mucosa obtained from the third patient three days after Ad2/CFTR-1 
administration. This section shows a morphology consistent with CF, i.e., a thickened 
basement membrane and occasional morphonuclear cells in the submucosa, but no 
abnormalities that could be attributed to the adenovirus vector; 

15 

Figure 27 shows transepithelial voltage (V^) across the nasal epitheliimi of a normal 
human subject. Amiloride (|iM) and terbutaline (^M) were perfused onto the mucosal 
surface beginning at the times indicated. Under basal conditions (V^) was electrically 
negative. Perfusion of amiloride onto the mucosal surface inhibited (V^) by blocking apical 
20 Na"^ channels; 

Figures 28A and 28B show transepithelial voltage (Vt) across the nasal epithelium of 
normal human subjects (Figure 28A) and patients with CF (Figure 28B). Values were 
obtained under basal conditions, during perfiision with amiloride (|iM), and during perfiision 
25 of amiloride plus terbutaline (fiM) onto the mucosal surface. Data are from seven normal 
subjects and nine patients with CF. In patients with CF, (Vt) was more electrically negative 
than in normal subjects (Figure 28B). Amiloride inhibited (V^) in CF patients, as it did in 
normal subjects. However, V^ failed to hypeipolarize when terbutaline was perfiised onto the 
epithelium in the presence of amiloride. Instead, (Vt) either did not change or became less 
. 30 negative, a result very different from that observed in normal subjects; 

Figures 29A and 29B show transepithelial voltage (Vt) across the nasal epithelium of 
a third patient before (Figure 29A) and after (Figure 29B) administration of approximately 
25 MOI of Ad2/CFTR-1 . Amiloride and terbutaline were perfiised onto the mucosal surface 
35 beginning at the times indicated. Figure 29 A shows an example from the third patient before 
treatment. Figure 29B shows that in contrast to the response before Ad2/CFTR-1 was 
applied, after virus replication, in the presence of amiloride, terbutaline stimulated Vt; 
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Figures 30A-30F show the time of course changes in transepithelial electrical 
properties before and after administration of Ad2/CFTR-1. Figures 30A and 30B are from 
the first patient who received approximately 1 MOI; Figures 30C and 30D are from the 
second patient who received approximately 3 MOI; and Figures 30E and 30F are from the 
5 third patient who received approximately 25 MOI. Figures 30A, 30C, and 30E show values 
of basal transeptithelial voltage (Vt) and Figures 30B, 30D, and 30F show the change in 
transepithelial voltage (AVf) following perfusion of terbutaline in the presence of amiloride. 
Day zero indicates the day of Ad2/CFTR-1 administration. Figures 30A, 30C, and 30E show 
the time course of changes in basal for all three patients. The decrease in basal Vt 
1 0 suggests that application of Ad2/CFTR-1 corrected the CF electrolyte transport defect m 
nasal epithelium of all three patients. Additional evidence came from an examination of the 
response to terbutalme. Figures 30B, 30D, and 30F show the time course of the response. 
These data indicate that Ad2/CFTR-1 corrected the CF defect in CI" transport; 

1 5 Figure 3 1 shows the tune course of changes in transepithelial electrical properties 

before and after administration of salme instead of Ad2/CFTR-1 to CF patients. Day zero 
indicates the time of mock administration. The top graph shows basal transepithelial voltage 
(Vt) and the bottom graph shows the change in transepithelial voltage following perfusion 
with terbutaline m the presence of amiloride (AVt). Closed symbols are data from two 

20 patients that received local anesthetic/vasoconstriction and placement of the applicator for 
thirty minutes. Open symbol is data from a patient that received local 
anesthetic/vasoconstriction, but not placement of the applicator. Symptomatic changes and 
physical findings were the same as those observed in CF patients treated with a similar 
administration procedure and Ad2/CFTR-1; 

25 

Figure 32 shows a map of the second generation adenovirus based vector, PAV; 

Figure 33 shows the plasmid construction of a second generation adenoviral vector 6 
(AdE4 0RF6); 

• 30 

Figure 34 is a schematic of Ad2-ORF6/PGK-CFTR which differs from Ad2/CFTR in 
that the latter utilized the endogenous Ela promoter, had no poly A addition signal directly 
downstream of CFTR and retamed an intact E4 region; 

35 Figure 35 shows short-circuit currents from human CF nasal polyp epithelial cells 

mfected with Ad2-ORF6/PGK-CFTR at multiplicities of 0.3, 3, and 50. At the indicated 
times: (1) 10 ^M amiloride, (2) cAMP agonists (10 forskolin and 100 |iM IBMX, and (3) 
1 mM diphenylamine-2-carboxylate were added to the mucosal solution; 
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Figiires 36A-36D show inununocytochemistry of nasal brushings by laser scanning 
microscopy of the Rhesus monkey C, before infection (36A) and on 7 days (36B); 24 (36C); 
and 38 (36D) after the first infection with Ad2-ORF6/PGK-CFTR; 

Figures 37A-37D show immunocytochemistry of nasal brushings by laser scanning 
microscopy of Rhesus monkey D, before infection (37A) and on days 7 (37B); 24 (37C); and 
48 (37D) after the first infection with Ad2-ORF6/PGK-CFTR; 

Figures 38A-38D show immunocytochemistry of nasal brushings by laser scanning 
microscopy of the Rhesus monkey E, before infection (38A) and on days 7 (38B); 24 (38C); 
and 48 (38D) after the first infection with Ad2-ORF6/PGK-CFTR; 

Figures 39A-39C show surmnaries of the clinical signs (or lack thereof) of infection 
with Ad2-ORF6/PGK-CFTR; 

Figures 40A-40C shows a summary of blood counts, sedimentation rate, and clinical 
chemistries after infection with Ad2-ORF6/PGK-CFTR for monkeys C, D, and E. There was 
no evidence of a systemic inflammatory response or other abnormalities of the clinical 
chemistries; 

Figure 41 shows summaries of white blood cells counts in monkeys C, D, and E after ' 
infection with Ad2-ORF6/PGK-CFTR. These date indictate that the administration of Ad2- 
0RF6/PGK-CFTR caused no change in the distribution and number of inflammatory cells at 
any of the time points following viral administration; 

Figure 42 shows histology of submucosal biopsy performed on Rhesus monkey C on 
day 4 after the second viral instillation of Ad2-ORF6/PGK-CFTR. Hematoxylin and eosin 
stain revealed no evidence of inflammation or cytopathic changes; 

Figure 43 shows histology of submucosal biopsy performed on Rhesus monkey D on 
day 1 1 after the second viral instillation of Ad2-ORF6/PGK-CFTR. Hematoxylin and eosin 
stain revealed no evidence of inflammation or cytopathic changes; 

Figure 44 shows histology of submucosal biopsy performed on Rhesus monkey E on 
day 1 8 after the second viral instillation of Ad2-ORF6/PGK-CFTR. Hematoxylin and eosin 
stain revealed no evidence of inflammation or cytopathic changes; and 
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Figures 45 A-4SC show antibody titers to adenovirus prior to and after the first and second 
administrations of Ad2-ORF6/PGK-CFrR. Prior to administration of Ad2-ORF6/PGK- 
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CFTR, the monkeys had received instillations of Ad2/CFTR-1 . Antibody titers measured by 
ELISA rose within one week after the first and second administrations of Ad2-ORF6/PGK« 
CFTR. Serum neutralizing antibodies also rose within a week after viral administration and 
peaked at day 24. No anti-adenoviral antibodies were detected by ELISA or neutralizing 
assay in nasal washings of any of the monkeys. 

Detailed Description and Best Mode 

Qgng Therapy 

As used herein, the phrase "gene therapy" refers to the transfer of genetic material 
(e.g., DNA or RNA) of interest into a host to treat or prevent a genetic or acquired disease or 
condition. The genetic material of interest encodes a product (e.g., a protein polypeptide, 
peptide or fimctional RNA) whose production in vivo is desired. For example, the genetic 
material of interest can encode a hormone, receptor, enzyme or (poly) peptide of therapeutic 
value. Examples of genetic material of interest include DNA encoding: the cystic fibrosis 
transmembrane regulator (CFTR), Factor VIII, low density lipoprotein receptor, beta- 
galactosidase, alpha-galactosidase, beta-glucocerebrosidase, insulin, parathyroid hormone, 
and alpha- 1 -antitrypsin. 

Although the potential for gene therapy to treat genetic diseases has been appreciated 
for many years, it is only recently that such approaches have become practical with the 
treatment of two patients with adenosine deamidase deficiency. The protocol consists of 
removing lymphocytes fi-om the patients, stimulating them to grow in tissue culture, infecting 
them with an appropriately engineered retrovirus followed by reintroduction of the cells into 
the patient (Kantoff, P. et al. (1987) J, Exp. Med, 166:219). Initial results of treatment are 
very encouraging. With the approval of a number of other human gene therapy protocols for 
limited clinical use, and with the demonstration of the feasibility of complementing the CF 
defect by gene transfer, gene therapy for CF appears a very viable option. 

The concept of gene replacement therapy for cystic fibrosis is very simple; a 
preparation of CFTR coding sequences in some suitable vector in a viral or other carrier 
delivered directly to the airways of CF patients. Since disease of the pulmonary airways is 
the major cause of morbidity and is responsible for 95% of mortality, airway epithelial cells 
are preferred target cells for CF gene therapy. The first generation of CF gene therapy is 
likely to be transient and to require repeated delivery to the airways. Eventually, however, . 
gene therapy may offer a cure for CF when the identity of the precursor or stem cell to air 
epithelial cells becomes known. If DNA were incorporated into airway stem cells, all 
subsequent generations of such cells would make authentic CFTR firom the integrated 
sequences and would correct the physiological defect aknost irrespective of the biochemical 
basis of the action of CFTR. 
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Although simple in concept, scientific and clinical problems face approaches to gene 
therapy, not least of these being that CF requires an in vivo approach ^iiile all gene therapy 
treatments in humans to date have involved ex vivo treatment of cells taken from the patient 
followed by reintroduction, 
5 One major obstacle to be overcome before gene therapy becomes a viable treatment 

approach for CF is the development of appropriate vectors to infect tissue manifesting the 
disease and deliver the therapeutic CFTR. gene. Since viruses have evolved very efficient 
means to introduce their nucleic acid into cells, many approaches to gene therapy make use of 
engineered defective virases. However, the use of viruses in vivo raises safety concerns. 
1 0 Although potentially safer, the use of simple DNA plasmid constructs containing miiumal 
additional DNA, on the other hand, is often very inefficient and can resuU in transient protein 
expression. 

The integration of introduced DNA into the host chromosome has advantages in that 
such DNA will be passed to daughter cells. In some circumstances, integrated DNA may 

1 5 also lead to high or more sustained expression. However, mtegration often, perhaps always, 
reqmres cellular DNA replication in order to occur. This is certainly the case with the present 
generation of retroviruses. This Umits the use of such viruses to circumstances where cell 
division occurs in a high proportion of cells. For cells cultured in vitro j this is seldom a 
problem, however, the cells of the airway are reported to divide only infrequently 

20 (Kawanami, O. et al. (1979) An. Rev, Respir. Dis. 120:595). The use of retroviruses in CF 
vnW probably require damaging the airways (by agents such as SO2 or O3) to induce cell 
division. This may prove impracticable in CF patients. 

Even if efficient DNA integration could be achieved using vuiises, the human genome 
contains elements involved in the regulation of cellular growth only a small fraction of which 

25 are presently identified. By integrating adjacent to an element such as a proto-oncogene or an 
anti-oncogene, activation or inactivation of that element could occur leading to uncontrolled 
growth of the altered cell. It is considered likely that several such activation/inactivation 
steps are usually required in any one cell to induce uncontrolled proliferation (R.A. Weinberg 
(1989) Cancer Research 49:3713 ), which may reduce somewhat the potential risk. On the 

30 other hand, insertional mutagenesis leading to tumor formation is certainly known in animals 
with some nondefective retroviruses (R.A. Weinberg, supra; Payne, G.S. et al. (1982) Nature 
295:209), and the large numbers of potential integrations occurring during the lifetime of a 
patient treated repeatedly in vivo with retroviruses must raise concerns on the safety of such a 
procedure. 

35 In addition to the potential problems associated with viral DNA integration, a number 

of additional safety issues arise. Many patients may have preexisting antibodies to some of 
the viruses that are candidates for vectors, for example, adenoviruses. In addition, repeated 
use of such vectors might induce an immune response. The use of defective viral vectors 
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may alleviate this problem somewhat, because the vec .ors will not lead to productive viral 
life cycles generating infected cells, cell lysis or large aumbers of progeny viruses. 

Other issues associated with the use of viruses are the possibility of recombination 
with related viruses naturally infecting the treated patient, complementation of the viral 
5 defects by simultaneous expression of wild type virus proteins and containment of aerosols of 
the engineered viruses. 

Gene therapy approaches to CF will face many of the same clinical challenges at 
protein therapy. These include the inaccessibility of airway epithelium caused by mucus 
build-up and the hostile nature of the environment in CF akways which may inactivate 
1 0 viruses/vectors. Elements of the vector carriers may be immunogenic and introduction of the 
DNA may be inefficient. These problems, as with protein therapy, are exacerbated by the 
absence of a good animal model for the disease nor a simple clinical end point to measure the 
efficacy of treatment. 

15 CF Gene Ther apv Vectors - Possible Options 

Retroviruses - Although defective retroviruses are the best characterized system and 
so far the only one approved for use in hximan gene therapy (Miller, A.D. (1990) Blood 
76:271), the major issue in relation to CF is the requirement for dividing cells to achieve 
20 DNA integration and gene expression. Were conditions found to induce airway cell division, 
the in vivo application of retroviruses, especially if repeated over many years, would 
necessitate assessment of the safety aspects of insertional mutagenesis in this context. 

Adeno-Associated Virus - (AAV) is a naturally occurring defective virus that requires 
25 other viruses such as adenoviruses or herpes viruses as helper viruses(Muzyczka, N. (1 992) in 
Current Topics in Microbiology and Immunology 158:97). It is also one of the few viruses 
that may integrate its DNA into non-dividing cells, although this is not yet certain. Vectors 
containing as little as 300 base pairs of AAV can be packaged and can integrate, but space for 
exogenous DNA is limited to about 4.5 kb. CFTR DNA may be towards the upper limit of 
30 packaging. Furthermore, the packaging process itself is presently inefficient and safety issues 
such as immunogenecity, complementation and containment will also apply to AAV. 
Nevertheless, this system is sufficiently promising to warrant further study. 

Plasmid DNA - Naked plasmid can be introduced into muscle cells by injection into 
35 the tissue. Expression can extend over many months but the number of positive cells is low 
(Wolff, J. et al. (1989) Science 247:1465). Cationic lipids aid introduction of DNA into some 
cells in culture (Feigner, P. and Ringold, G.M. (1989) Nature 33T3%iy Injection of cationic 
lipid plasmid DNA complexes into the circulation of mice has been shown to result in 
expression of the DNA in lung (Brigham, K. et al. (1989) Am. J, Med Set 298:278). 
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Instillation of cationic lipid plasmid DNA into lung also leads to expression in epithelial cells 
but the efficiency of expression is relatively low and transient (Hazinski, T.A. et al. (1991) 
Am. J. Respir, Cell Mol Biol 4:206). One advantage of the use of plasmid DNA is that it 
can be introduced into non-replicating cells. However, the use of plasmid DNA in the CF 
5 airway environment, which already contains high concentrations of endogenous DNA may be 
problematic. 

Receptor Mediated Entrv - In an effort to improve the efficiency of plasmid DNA 
uptake, attempts have been made to utilize receptor-mediated endocytosis as an entry 

10 mechanisms and to protect DNA in complexes with polylysine (Wu, G, and Wu, C.H. (1988) 
J. Biol Chem. 263:14621). One potential problem with this approach is that the incommg 
plasmid DNA enters the pathway leading from endosome to lysosome, where much incoming 
material is degraded. One solution to this problem is the use of transferrin DNA-polylysine 
complexes linked to adenovirus capsids (Curiel, D.T. et al. (1991) Proc. Natl Acad Sci. USA 

1 5 88:8850). The latter enter efficiently but have the added advantage of naturally disrupting the 
endosome thereby avoiding shuttling to the lysosome. This approach has promise but at 
present is relatively transient and suffers from the same potential problems of 
immunogenicity as other adenovirus based methods. 



20 Adenovirus - Defective adenoviruses at present appear to be a promising approach to 

CF gene therapy (Berkner, K.L. (1988) BioTechniques 6:616). Adenovirus can be 
manipulated such that it encodes and expresses the desired gene product, (e.g., CFTR), and at 
the same time is inactivated in terms of its ability to replicate in a normal lytic viral life cycle. 
In addition, adenovirus has a natural tropism for airway epithelia. The viruses are able to 

25 infect quiescent cells as are foimd in the airways, offering a major advantage over 

retroviruses. Adenovirus expression is achieved without integration of the viral DNA into the 
host cell chromosome, thereby alleviating concerns about insertional mutagenesis. 
Furthermore, adenoviruses have been xxsed as live enteric vaccines for many years with an 
excellent safety profile (Schwartz, A.R. et al. (1974) Am. Rev. Respir. Dis. 109:233-238). 

30 Finally, adenovirus mediated gene transfer has been demonstrated in a number of instances 
including transfer of alpha- 1 -antitrypsin and CFTR to the lungs of cotton rats (Rosenfeld, 
M.A. et al. (1991) Science 252:431-434; Rosenfeld et al., (1992) Cell 68:143-155). 
Furthermore, extensive studies to attempt to establish adenovirus as a causative agent in 
human cancer were uniformly negative (Green, M. et al. (1979) Proc. Natl Acad. Set USA 

35 76:6606). 

The following properties would be desirable in the design of an adenovirus vector to 
transfer the gene for CFTR to the airway cells of a CF patient. The vector should allow 
sufficient expression of the CFTR, while producing minimal viral gene expression. There 
should be minimal viral DNA replication and ideally no virus replication. Fmally, 
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recombination to produce new viral sequences and complementation to allow growth of the 
defective virus in the patient should be minimized. A first generation adenovirus vector 
encoding CFTR (Ad2/CFTR), made as described in the following Example 7, achieves most 
of these goals and was used in the human trials described in Example 1 0. 
5 Figure 14 shows a map of Ad2/CFTR-L As can be seen from the figure, this first . 

generation virus includes viral DNA derived from the common relatively benign adenovirus 2 
serotype. The Ela and Elb regions of the viral genome, which are involved in early stages of 
viral replication have been deleted. Their removal impairs viral gene expression and viral 
replication. The protein products of these genes also have immortalizing and transforming 

1 0 fimction in some non-permissive cells. 

The CFTR coding sequence is inserted into the viral genome in place of the Ela/Elb 
region and transcription of the CFTR sequence is driven by the endogenous Ela promoter. 
This is a moderately strong promoter that is fiinctional in a variety of cells. In contrast to 
some adenovirus vectors (Rosenfeld, M. et al. (1992) Cell 68:143), this adenovirus retains 

15 the E3 viral coding region. As a consequence of the inclusion of E3, the length of the 

adenovirus-CFTR DNA is greater than that of the wild-type adenovirus. The greater length 
of the recombinant viral DNA renders it more difficult to package. This means that the 
growth of the Ad2/CFTR virus is impaired even in permissive cells that provide the missing 
Ela and Elb fimctions. 

20 The E3 region of the Ad2/CFTR-1 encodes a variety of protems. One of these 

proteins, gpl9, is believed to interact with and prevent presentation of class 1 proteins of the 
major histocompatability complex (MHC) (Gooding, C.R. and Wold, W.S.M. (1990) Crit 
Rev, Immunol 10:53). This property prevents recognition of the infected cells and thus may 
allow viral latency. The presence of E3 sequences, therefore, has two usefiil attributes; first, 

25 the large size of the viral DNA renders it doubly defective for replication (i.e., it lacks early 
fimctions and is packaged poorly) and second, the absence of MHC presentation could be 
usefiil in later applications of Ad2/CFTR-1 m gene therapy involving multiple 
administrations because it may avoid an immune response to recombinant virus containing 
cells. 

, 30 Not only are there advantages associated with the presence of E3 ; there may be 

disadvantages associated vdth its absence. Studies of E3 deleted virus in animals have 
suggested that they result in a more severe pathology (Gingsberg, H.S. et al. (1989) Proc. 
Natl Acad ScL (USA) 86:3823). Furthermore, E3 deleted virus, such as might be obtained 
by recombination of an El plus E3 deleted virus with wild-type virus, is reported to outgrow 
35 wild-type in tissue culture (Barkner, K.L. and Sharp, P. (1 983) Nucleic Acids Research 
1 1 :6003). By contrast, however, a recent report of an E3 replacement vector encoding 
hepatitis B surface antigen, suggests that when delivered as a live enteric vaccine, such a 
virus replicates poorly in human compared to wild-type. 
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The adenoviras vector.(Ad2/CFTR-l) and a related virus encoding the marker p- 
galactosidase (Ad2/p-gal) have been constructed and grown in human 293 cells. These cells 
contain the El region of adenovirus and constitutively express Ela and Elb, which 
complement the defective adenoviruses by providing the products of the genes deleted from 
5 the vector. Because the size ofits genome is greater than that of wild-type virus, Ad2/CFTR 
is relatively diffictilt to produce. 

The Ad2/CFTR-1 virus has been shown to encode CFTR by demonstrating the 
presence of the protein in 293 cells. The Ad2/p-gal virus was shown to produce its protein in 
a variety of cell lines grown in tissue culture including a monkey bronchiolar cell line 
1 0 (4MBR-5), primary hamster tracheal epithelial cells, human HeLa, human CP PAC cells (see 
Example 8) and airway epithelial cells from CP patients (Rich, O. et al. (1990) Nature 
347:358). 

Ad2/CFTR-1 is constructed from adenovirus 2 (Ad2) DNA sequences. Other 
varieties of adenovirus (e.g., Ad3, Ad5, and Ad7) may also prove useful as gene therapy 
1 5 vectors. This may prove essential if immune response against a single serotype reduces the 
effectiveness of the therapy. 

Second Generation Adenoviral Vectors 

Adenoviral vectors currently in use retain most (> 80%) of the parental viral genetic 
20 material leaving then: safety untested and in doubt. Second-generation vector systems 
containing minimal adenoviral regulatory, packaging and replication sequences have 
therefore been developed. 

Pseudo-Adenovirus Vectors nPAVV PAVs contain adenovirus inverted terminal 
25 repeats and the minimal adenovirus 5' sequences required for helper virus dependent 

replication and packaging of the vector. These vectors contain no potentially harmful viral 
genes, have a theoretical capacity for foreign material of nearly 36 kb, may be produced in 
reasonably high titers and maintain the tropism of the parent virus for dividing and non- 
dividing human target cell types. 
30 The PAV vector can be maintamed as either a plasmid-bome construct or as an 

infectious viral particle. As a plasmid construct, PAV is composed of the minimal sequences 
from wild type adenovirus type 2 necessary for efficient replication and packaging of these 
sequences and any desired additional exogenous genetic material, by either a wild-type or 
defective helper virus. 

35 Specifically, PAV contains adenovirus 2 (Ad2) sequences as shown in Figure 17, 

from nucleotide (nt) 0-356 forming the 5' end of the vector and the last 109 nt of Ad2 
forming the 3' end of the construct. The sequences includes the Ad2 flanking inverted 
terminal repeats (5'ITR) and the 5* ITR adjoining sequences containing the known packaging 
signal and Ela enhancer. Various convenient restriction sites have been incorporated into the 
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j&^gments, allowing the insertion of promoter/gene cassettes which can be packaged in the 
PAV virion and used for gene transfer (e.g. for gene therapy). The construction and 
propagation of PAV is described in detail in the following Example 1 1. By not containing 
most native adenoviral DNA, the PAVs described herein are less likely to produce a patient 
S immune reponse or to replicate in a host. 

In addition, the PAV vectors can accomodate foreign DNA up to a maximum length 
of nearly 36 kb. The PAV vectors therefore, are especially useful for cloning larger genes 
(e.g., CFTR (7.5 kb)); Factor VIII (8 kb); Factor IX (9 kb)), which, traditional vectors have 
difficulty accomodating. In addition, PAV vectors can be used to transfer more than one 
10 gene, or more than one copy of a particular gene. For example, for gene therapy of cystic 
fibrosis, PAVs can be used to deliver CFTR in conjunction with other genes such as anti 
proteases (e.g., antiprotease alpha-1 -antitrypsin) tissue inhibitor of metaloproteinase, 
antioxidants (e.g., superoxide dismutase), enhancers of local host defense (e.g., interferons), 
mucolytics (e.g., DNase); and proteins which block inflammatory cytokines. 

15 

Ad2-P4/ORr6 Agenpyirus Ygctprg 

An adenoviral construct expressing only the open reading fi-ame 6 (0RF6) of 
adenoviral early region 4 (E4) firom the E4 promoter and which is deleted for all other known 
E4 open reading firames was constructed as described in detail in Example 12. Expression of 

20 E4 open reading frame 3 is also sufficient to provide E4 functions required for DNA 

replication and late protein synthesis. However, it provides these functions with reduced 
efficiency compared to expression of 0RF6, which will likely result in lower levels of virus 
production. Therefore expressing 0RF6, rather than ORF3, appears to be a better choice for 
producing recombinant adenovirus vectors. 

25 The E4 region of adenovirus is suspected to have a role in viral DNA replication, late 

mRNA synthesis and host protein synthesis shut off, as well as in viral assembly (Falgout, B. 
and G. Ketner (1987) 1 Virol 61:3759-3768), Adenovirus early region 4 is required for 
efficient virus particle assembly. Adenovirus early region 4 encodes functions required for 
efficient DNA replication, late gene expression, and host cell shutoff. Halbert, D.N, et al. 

30 (1985)7. Firo/. 56:250-257. 

The deletion of non-essential open reading frames of E4 increases the cloning 
capacity of recombinant adenovirus vectors by approximately 2 kb of insert DNA without 
significantly reducing the viability of the virus in cell culture. When placed in combination 
with deletions in the El and/or E3 regions of adenovirus vectors, the theoretical insert 

35 capacity of the resultant vectors is increased to 8-9 kb. An example of where this increased 
cloning capacity may prove useful is in the development of a gene therapy vector encoding 
CFTR. As described above, the first generation adenoviral vector approaches the maximiun 
packaging capacity for viral DNA encapsidation. As a result, this virus grows poorly and 
may occassionaly give rise to defective progeny. Including an E4 deletion in the adenovirus 
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vector should alleviate these problems. In addition, it allows flexibility in the choice of 
promoters to drive CFTR expression from the virus. For example, strong promoters such as 
the adenovirus major late promoter, the cytomegalovirus immediate early promoter or a 
cellular promoter such as the CFTR promoter, which may be too large for first-generation 
S adenovirus can be used to drive expression. 

In addition, by expressing only 0RF6 of E4, these second generation adenoviral 
vectors may be safer for use in gene therapy. Although 0RF6 expression is sufficient for 
vural DNA replication and late protein synthesis in immortalized cells, it has been suggested 
that ORF6/7 of E4 may also be required in non-dividing primary cells (Hemstrom, C. et al. 

10 (1991) J. Virol 65:1440-1449). The 19 kD protein produced from open reading frame 6 and 
7 (ORF6/7) complexes with and activates cellular transcription factor E2F, which is required 
for maximal activation of early region 2. Early region 2 encodes proteins required for viral 
DNA replication. Activated transcription factor E2F is present in proliferating cells and is 
involved in the expression of genes required for cell proliferation (e.g., DHFR, c-myc), 

1 5 whereas activated E2F is present in lower levels in non-proliferating cells. Therefore, the 
expression of only 0RF6 of E4 should allow the virus to replicate normally in tissue culture 
cells (e.g.,.293 cells), but the absence of ORF6/7 would prevent the potential activation of 
transcription factor E2F in non-dividing primary celUs and thereby reduce the potential for 
viral DNA replication. 

20 

Targgt Tim9 

Because 95% of CF patients die of lung disease, the lung is a preferred target for gene 
therapy. The halbnark abnormality of the disease is defective electrolyte transport by the 
epithelial cells that line the airways. Numerous investigators (reviewed in Quinton, F. (1990) 

25 FASEB J. 4:2709) have observed: a) a complete loss of cAMP-mediated transepithelial 
chloride secretion, and b) a two to three fold increase in the rate of Na+ absorption. cAMP- 
stimulated chloride secretion requires a chloride channel in the apical membrane (Welsh, M. J. 
(1987) Physiol Rev, 67:1 143-1 184), The discovery that CFTR is a phosphorylation-regulated 
chloride channel and that the properties of the CFTR chloride channel are the same as those 

30 of the chloride channels in the apical membrane, indicate that CFTR itself mediates 

transepithelial chloride secretion. This conclusion was supported by studies localizing CFTR 
in lung tissue: CFTR is located in the apical membrane of airway epithelial cells (Denning, 
G.M.etaL (1992)J. CellBiol 118:551) and has been reported to be present in the 
submucosal glands (Taussig et al., (1973) 1 Clin, Invest, 89:339). As a consequence of loss 

35 of CFTR function, there is a loss of cAMP-regulated transepithelial chloride secretion. At 
this time it is uncertain how dysfunction of CFTR produces an increase in the rate of Na+ 
' absorption. However, it is thought that the defective chloride secretion and increased Na+ 
absorption lead to an alteration of the respiratory tract fluid and hence, to defective 
mucociliary clearance, a normal pulmonary defense mechanism. As a result, clearance of 
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inhaled material from the lung is impaired and repeated infections ensue. Although the 
presumed abnormalities in respiratory tract fluid and mucociliary clearance provide a 
plausible explanation for the disease, a precise understanding of the pathogenesis is still 
lacking. 

5 Correction of the genetic defect in the airway epithelial cells is likely to reverse the 

CF pulmonary phenotype. The identity of the specific cells in the airway epithelium that 
express CFTR cannot be accurately determined by immunocytochemical means, becaxise of ^ 
the low abundance of protein. However, functional studies suggest that the ciliated epithelial 
cells and perhaps nonciliated cells of the surface epithelium are among the main cell types 

10 involved in electrolyte transport. Thus, in practical terms, the present preferred target cell for 
gene therapy would appear to be the mature cells that line the puknonary airways. These are 
not rapidly dividing cells; rather, most of them are nonproliferating and many may be 
terminally differentiated. The identification of the progenitor cells in the airway is uncertain. 
Although CFTR may also be present in submucosal glands (Trezise, A.E. and Buchwald, M. 

15 (1991) Nature 353:434; Englehardt, J.F. et al. (1992) J. Clin. Invest 90:2598-2607), there is 
no data as to its function at that site; furthermore, such glands appear to be relatively 
inaccessible. 

The airway epithelium provides two main advantages for gene therapy. First, access 
to the airway epithelium can be relatively noninvasive. This is a significant advantage in the 
20 development of delivery strategies and it will allow investigators to monitor the therapeutic 
response. Second, the epithelium forms a barrier between the airway lumen and the 
interstitium. Thus, application of the vector to the lumen will allow access to the target cell 
yet, at least to some extent, limit movement through the epithelial barrier to the interstitium 
and from there to the rest of the body. 

25 

Efficiencv of Gene Delivery Re quired to Correct The Genetic Defect 

It is unlikely that any gene therapy protocol will correct 100% of the cells that 
normally express CFTR. However, several observations suggest that correction of a small 
percent of the involved cells or expression of a fraction of the normal amount of CFTR may 
. 30 be of therapeutic benefit. 

a. CF is an autosomal recessive disease and heterozygotes have no lung disease. 
Thus, 50% of wild-type CFTR would appear sufficient for normal function. 

35 b. This issue was tested in mixing experiments using CF cells and recombinant 

CF cells expressing wild-type CFTR (Johnson, L.G. et al. (1992) Nature Gen, 2:21). The 
data obtained showed that when an epithelium is reconstituted with as few as 6-10% of 
corrected cells, chloride secretion is comparable to that observed with an epithelium 
containing 100% corrected cells. Although CFTR expression in the recombinant cells is 
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probably higher than in normal cells, this result suggests that in vivo correction of all CF 
airway cells may not be required. 

c. Recent observations show that CFTR containing some CF-associated 
5 mutations retains residual chloride channel activity (Sheppard, D.N. et al. (1992) Pediatr, 
Pulmon SuppL 8:250; Strong, T.V. et al, (1991) N. Eng. J. Med 325: 1630), These mutations 
are associated with mild lung disease. Thus, even a very low level of CFTR activity may at 
least partly ameliorate the electrolyte transport abnormalities. 

10 d. As indicated in experiments described below in Example 8, complementation 

of CF epitheUa, under conditions that probably would not cause expression of CFTR in every 
cell, restored cAMP stimulated chloride secretion. 

e. Levels of CFTR in noraial human airway epithelia are very low and are barely 
1 5 detectable. It has not been detected using routine biochemical techniques such as 

immunoprecipitation or immunoblotting and has been exceedingly difficuh to detect with 
immunocytochemical techniques (Denning, G.M. et al. (1992) J. Cell Biol 1 18:551). 
Although CFTR has been detected in some cases using laser-scanning confocal microscopy, 
the signal is at the limits of detection and cannot be detected above background in every case, 
20 Despite that minimal levels of CFTR, this small amount is sufficient to generate substantial 
cAMP-stimulated chloride secretion. The reason that a very small number of CFTR chloride 
channels can support a large chloride secretory rate is that a large number of ions can pass 
through a smgle channel (lO^-lO^ ions/sec) (Hille, B, (1984) Sinauer Assoc. Inc., 
Sunderland, MA 420-426). 

25 

f Previous studies using quantitative PCR have reported that the airway 
epithelial cells contain at most one to two transcripts per cell (Trapnell, B.C. et al. (1991) 
Proc. Natl Acad Sci. USA 88:6565). 

. 30 Gene therapy for CF would appear to have a wide therapeutic index. Just as partial 

expression may be of therapeutic value, overexpression of wild-type CFTR appears unlikely 
to cause significant problems. This conclusion is based on both theoretical considerations 
and experimental results. Because CFTR is a regulated channel, and because it has a specific 
function in epithelia, it is unlikely that overexpression of CFTR will lead to uncontrolled 
35 chloride secretion. First, secretion would require activation of CFTR by cAMP-dependent 
phosphorylation. Activation of this kinase is a highly regulated process. Second, even if 
CFTR chloride channels open in the apical membrane, secretion will not ensue v^thout 
regulation of the basolateral membrane transporters that are requured for chloride to enter the 
cell from the interstitial space. At the basolateral membrane, the sodium-potassium-chloride 
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cotransporter and potassium channels serve as important regulators of transeptihelial 
secretion (Welsh, M.J. (mi) Physiol Rev. 67:1143-1184). 

Human CFTR has been expressed in transgenic mice under the control of the 
surfactant protein C(SPC) gene promoter (Whitesett, J.A. et al. (1992) Nature Gen. 2:13) and 
5 the casein promoter (Ditullio, P. et al (1 992) Bio/Technology 1 0:74 ). In those mice, CFTR 
was overexpressed in bronchiolar and alveolar epithelial cells and in the mammary glands, 
respectively. Yet despite the massive overexpression in the traxisgenic animals, there were no 
observable morphologic or functional abnormalities. In addition, expression of CFTR in the 
lungs of cotton rats produced no reported abnomialities (Rosenfeld, MA. et al. (1992) Cell 
10 68:143-155). 

The present invention is further illustrated by the foUowuig examples which in no 
way should be construed as being further limiting. The contents of all cited references 
(including literature references, issued patents, published patent applications, and co-pending 
patent applications) cited throughout this application are hereby expressly incorporated by 
15 reference. 

EXAMPLES 

Example 1 - Generation of Full Length CFTR cDNAs 

20 Nearly all of the conmionly used DNA cloning vectors are based on plasmids 

containing modified pMBl replication origins and are present at up to 500 to 700 copies per 
cell (Sambrook et al. Molecular Cloning: A Laboratory Manual (Cold Spring Harbor 
Laboratory Press 1989). The partial CFTR cDNA clones isolated by Riordan et al. were 
maintained in such a plasmid. It was postulated that an alternative theory to intrinsic clone 

25 instability to explain the apparent inability to recover clones encoding full length CFTR 
protein using high copy number plasmids, was that it was not possible to clone large 
segments of the CFTR cDNA at high gene dosage in E. coll Expression of the CFTR or 
portions of the CFTR from regulatory sequences capable of directing transcription and/or 
translation in the bacterial host cell might resuk in inviability of the host cell due to toxicity 

30 of the transcript or of the full length CFTR protein or fragments thereof This inadvertent 
gene expression could occur from either plasmid regulatory sequences or cryptic regulatory 
sequences within the recombinant CFTR plasmid which are capable of functioning in E, coli. 
Toxic expression of the CFTR coding sequences would be greatly compounded if a large 
number of copies of the CFTR cDNA were present in cells because a high copy number 

35 plasmid was used. If the product was indeed toxic as postulated, the growth of cells 

containing full length and correct sequence would be actively disfavored. Based upon this 
novel hypothesis, the following procedures were undertaken. With reference to Figure 2, 
partial CFTR clone Tl 6-4.5 was cleaved with restriction enzymes Sph 1 and Pst 1 and the 
resulting 3.9 kb restriction fragment containing exons 1 1 through most of exon 24 (including 
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an uncharacterized 1 19 bp insertion reported by Riordan et al. between nucleotides 1716 and 
1 717), was isolated by agarose gel purification and ligated between the Sph 1 and Pst 1 sites 
of the pMBl based vector pkk223-3 (Brosius and Holy, (1984) Proc. Natl Acad Set 
£1:6929). It was hoped that the pMBl origin contained within this plasmid would allow it 
5 and plasmids constructed fi*om it to replicate at 1 5-20 copies per host E, coli cell (Sambrook 
et al. Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press 
1989). The resultant plasmid clone was called pkk«4.S, 

Partial CFTR clone Til was cleaved with ECQ El and HintJI and the 1.9 kb band 
encoding the first 1786 nucleotides of the CFTR cDNA plus an additional 100 bp of DNA at 

10 the 5* end was isolated by agarose gel purification. This restriction firagment was inserted 
between the ESQ Rl site and Sma 1 restriction site of the plamid Bluescript Sk- (Stratagene, 
catalogue number 212206), such that the CFTR sequences were now flanked on the upstream 
(5') side by a 1 site from the cloning vector. This clone, designated Tl 1-R, was cleaved 
with Sal 1 and S]2h 1 and the resultant 1.8 kb band isolated by agarose gel purification. 

1 5 Plasmid pkk-4.5 was cleaved with Sal 1 and ^[2h 1 and the large firagment was isolated by 
agarose gel purification. The purified Tl 1-R fragment and pkk-4.5 fragments were ligated to 
constmct pkk-CFTRl . pkk-CFTRl contains exons 1 through 24 of the CFTR cDNA. It was 
discovered that this plasmid is stably maintained m E. coli cells and confers no measureably 
disadvantageous growth characteristics upon host cells. 

20 pkk-CFTRl contains, between nucleotides 1 7 1 6 and 1 7 1 7, the 1 1 9 bp insert DNA 

derived from partial cDNA clone Tl 6-4.5 described above. In addition, subsequent sequence 
analysis of pkk-CFTRl revealed unreported differences in the coding sequence between that 
portion of CFTRl derived from partial cDNA clone Tl 1 and the published CFTR cDNA 
sequence. These undesired differences included a 1 base-pair deletion at position 995 and a 

25 C to T transition at position 1 507. 

To complete construction of an intact correct CFTR coding sequence without 
mutations or insertions and with reference to the construction scheme shown in Figure 3, 
pkk-CFTRl was cleaved with Xba 1 and Hpa 1, and dephosphorylated with calf intestinal 
alkalme phosphatase. In addition, to reduce the likelihood of recovering the original clone, 
, 30 the small unwanted Xba I /Hpa I restriction fragment from pKK-CFTRl was digested with 

Sph I . Tl 6- 1 was cleaved with ICbi I and Acfi 1 and the 1 . 1 5 kb fragment isolated by agarose 
gel purification. Tl 6-4.5 was cleaved with Aee 1 and Hpa I and the 0.65 kb band was also 
isolated by agarose gel purification. The two agarose gel purified restriction fiagments and 
the dephosphorylated pKK-CFTRl were ligated to produce pKK-CFTR2. Alternatively, 

35 pKK-CFTR2 could have been constructed using corresponding restriction fragments from the 
partial CFTR cDNA clone CI -1/5. pKK-CFTR2 contains the uninterrupted CFTR protein 
coding sequence and conferred slow growth upon E. coli host cells in which it was inserted, 
whereas pKK-CFTRl did not. The origin of replication of pKK-CFTR2 is derived from 
pMBl and confers a plasmid copy number of 15-20 copies per host cell. 
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Example 2 - Improving Host Cell Viability 

An additional enhancement of host cell viability was accomplished by a further 
reduction in the copy number of CFTR cDNA per host cell. This was achieved by 
5 transferring the CFTR cDNA into the plasmid vector, pSC-3Z. pSC-3Z was constructed 
using the pSClOl replication origin of the low copy number plasmid pLG338 (Stoker et al .. 
Gene 1&, 335 (1982)) and the ampicillin resistance gene and polylinker of pGEM-3Z 
(available from Promega). pLG338 was cleaved with Sjzh I and £xu Q and the 2.8 kb 
fragment containing the replication origin isolated by agarose gel purification. pGEM-3Z 

1 0 was cleaved with Alw the resultant restriction fragment ends treated with T4 DN A 
polymerase and deoxynucleptide triphosphates, cleaved with Sph I and the 1 .9 kb band 
containing the ampicillin resistance gene and the polylinker was isolated by agarose gel 
purification. The pLG338 and pGEM-3Z fragments were ligated together to produce the low 
. copy number cloning vector pSC-3Z. pSC-3Z and other plasmids containing pSClOl origins 

15 of replication are mamtamed at approximately five copies per cell (Sambrook gt supra). 
With additional reference to Figure 4, pKK-CFTR2 was cleaved with ECQ EY, Est I 
and Sal i and then passed over a Sephacryl S400 spun column (available from Pharmacia) 
according to the manufacturer's procedure in order to remove the Sal I to Eco EV restriction 
fiagment which was retained within the column. pSC-3Z was digested with Sma 1 and Est 1 

20 and also passed over a Sephacryl S400 spun colxram to remove the small Sma I/Est I 

restriction fragment which was retained within the column. The column eluted fractions from 
the pKK-CFTR2 digest and the pSC-3Z digest were mixed and ligated to produce pSC- 
CFTR2. A map ofthis plasmid is presented in Figure 5. Host cells containing CFTR cDN As 
at this and similar gene dosages grow well and have stably maintained the recombinant 

25 plasmid with the full length CFTR coding sequence. In addition, this plasmid contains a 
bacteriophage T7 RNA polymerase promoter adjacent to the CFTR coding sequence and is 
therefore convenient for in vitro transcription/translation of the CFTR protein. The 
nucleotide sequence of CFTR coding region from pSC-CFTR2 plasmid is presented in 
Sequence Listing 1 as SEQ ID NO: 1 . Significantly, this sequence differs from the previously 

30 published (Riordan, J.R. et al. (1989) Science 245:1066-1073) CFTR sequence at position 
1990, where there is C in place of the reported A. See Gregory, R.J. et al. (1990) Nature 
347:382-386. E. coli host cells containing pSC-CFTR2, internally identified with the number 
pSC-CFTR2/AGl, have been deposited at the American Type Culture Collection and given 
the accession number: ATCC 68244. 

35 

Example 3 ■ Alternate Method for Improving Host Cell Viability 

A second method for enhancing host cell viability comprises disruption of the CFTR 
protein coding sequence. For this purpose, a synthetic intron was designed for insertion 
between nucleotides 1716 and 1717 of the CFTR cDNA. This intron is especially 
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advantageous because of its easily manageable size. Furthermore, it is designed to be 
efiBciently spliced from CFTR primary RNA transcripts when expressed in eukaryotic cells. 
Four synthetic oligonucleotides were synthesized (1 195RG, 1 196RG, 1 197RG and 1 198RG) 
collectively extending from the Sell 1 cleavage site at position 1 700 to the Hinc II cleavage 
5 site at position 1785 and including the additional 83 nucleotides between 1716 and 1717 (see 
Figure 6). These oligonucleotides were phosphorylated with T4 polynucleotide kinase as 
described by Sambrook et al., mixed together, heated to 95°C for 5 minutes in the same 
buffer used during phosphorylation, and allowed to cool to room temperature over several 
hours to allow annealing of the single stranded oligonucleotides. To insert the synthetic 

1 0 intron into the CFTR coding sequence and with reference to Figures 7A and 7B, a subclone 
of plasmid Tl 1 was made by cleaving the M I site in the polylmker, repairing the recessed 
ends of the cleaved DNA with deoxynucleotide triphosphates and the large fragment of DNA 
Polymerase I and religating the DNA. This plasmid was then digested with £cq B^L and 
I and religated. The resulting plasmid T16-A5' extended from the Mm I site at position 490 of 

1 5 the CFTR cDNA to the 3' end of clone T16 and contained smgle sites for Sit I and Hinc II at 
positions coiresponding to nucleotides 1700 and 1785 of the CFTR cDNA. T16-A5' plasmid 
was cleaved with Sst I and Hinc II and the large fragment was isolated by agarose gel 
purification. The annealed synthetic oligonucleotides were ligated into this vector fragment 
to generate T16-intron. 

20 Tl 6-intron was then digested with Eoa £1 and Sma I and the large fragment was 

isolated by agarose gel purification. Tl 6-4.5 was digested with Ec2 El and Sea I and the 790 
bp fragment was also isolated by agarose gel purification. The purified Tl 6-intron and T16- 
4.5 fragments were ligated to produce T16-intron-2. T16-intron-2 contains CFTR cDNA 
sequences extending from the Nm I site at position 490 to the Sca I site at position 281 8, and 

25 inclxides the imique Hpa I site at position 2463 which is not present in Tl 6-1 or Tl 6-intron-l . 

T-16-intron-2 was then cleaved with 2Cha 1 and Hpa I and the 1 800 bp fragment was 
isolated by agarose gel purification. pKK-CFTRl was digested with 2Q2a I and Hpa I and the 
large fragment was also isolated by agarose gel purification and ligated with the fragment 
derived from T16-intron-2 to yield pKK-CFTR3, shown in Figure 8. The CFTR cDNA 

30 within pKK-CFTR3 is identical to that within pSC-CFTR2 and pKK-CFTR2 except for the 
insertion of the 83 bp mtron between nucleotides 1716 and 1717. The msertion of this intron 
resulted in improved growth characteristics for cells harboring pKK-CFTR3 relative to cells 
containing the unmodified CFTR cDNA in pKK-CFTR2. 

35 Example 4 - In vitro Transcription/Translation 

In addition to sequence analysis, the integrity of the CFTR cDNA open reading frame 
was verified by in vitro transcription/translation. This method also provided the initial CFTR 
protein for identification purposes. 5 micrograms of pSC-CFTR2 plasmid DNA were 
Imearized with Sal I and used to direct the synthesis of CFTR RNA transcripts with T7 RNA 
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polymerase as described by the 5iq)plier (Stratagene). This transcript was extracted with 
phenol and chloroform and precipitated with ethanoL The transcript was resuspended in 25 
microliters of water and varying amounts were added to a reticulocyte lysate in vitro 
translation system (Promega). The reactions were performed as described by the supplier in 
. 5 the presence of canine pancreatic microsomal membranes (Promega), using 35s-methionine 
to label newly synthesized proteins. In vitro translation products were analysed by 
discontinuous polyacrylamide gel electrophoresis in the presence of 0.1% SDS with 8% 
separating gels (Laemmii, U.K. (1970) Nature 227:680-685). Before electrophoresis, the in 
vitro translation reactions were denatured with 3% SDS, 8 M urea and 5% 2-mercaptoethanol 
10 in 0.65 M Tris-HCl, pH 6.8. Following electrophoresis, the gels were fixed in 

methanohacetic acid:water (30:10:60), rinsed with water and impregnated with 1 M sodium 
salicylate. 35s labelled proteins were detected by fluorgr^hy. A band of approximately 180 
kD was detected, consistent with translation of the full length CFTR insert. 

15 

Example 5 - Elimination of Cryptic Regulatory Signals 

Analysis of the DNA sequence of the CFTR has reyealed the presence of a potential 
E, coll RNA polymerase promoter between nucleotides 748 and 778 which conforms well to 
the derived consensus sequence for E. coli promoters (Reznikoff and McClure, Maximizing 

20 Gene Expression, 1, Butterworth Publishers, Stoneham, MA). If this sequence functions as a 
promoter functions in E. coli, it could direct synthesis of potentially toxic partial CFTR 
polypeptides. Thus, an additional advantageous procedure for maintaining plasmids 
containing CFTR cDNAs in Ecoli would be to alter the sequence of this potential promoter 
such that it will not function in E. coli. This may be accomplished without altering the amino 

25 acid sequence encoded by the CFTR cDNA. Specifically, plasmids containing complete or 
partial CFTR cDNA's would be altered by site-directed mutagenesis using synthetic 
olignucleotides (ZoUer and Smitii, (1983) Methods EnzymoL iQQ:468 ). More specifically, 
altering the nucleotide sequence at position 908 fi'om a T to C and at position 774 from an A 
to a G effectively eliminates the activity of this promoter sequence without altering the amino 
, 30 acid coding potential ofthe CFTR open reading frame. Other potential regulatory signals 

within the CFTR cDN A for transcription and translation could also be advantageously altered 
and/or deleted by the same method. 

Futher analysis has identified a sequence extending from nucleotide 908 to 936 which 
functions efficiently as a transcriptional promoter element in coli (Gregory, R.J. et al. 

35 (1 990) Nature 347:382-386). Mutation at position 936 is capable of inactivating this 
promoter and allowing the CFTR cDNA to be stably maintained as a plasmid in E. coli 
(Cheng, S.H. et al (1990) Cell 63:827-834). Specifically position 936 has been altered from 
a C to a T residue yvithout the? amino acid sequence encoded by the cDNA being altered. 
Other mutations within this regulatory element described in Gregory, RJ. et al. (1 990) 
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Nature 347:382-386 could also be used to inactivate the transcriptional promoter activity. 
Specifically, the sequence from 908 to 913 (TTGTGA) and from 931 to 936 (GAAAAT) 
could be altered by site directed mutagenesis without altering the amino acid sequence 
encoded by the cDNA. 

5 

Example 6 - Cloning of CFTR in Alternate Host Systems 

Although the CFTR cDNA displays apparent toxicity in E. coli cells, other types of 
host cells may not be affected in this way. Alternative host systems in vAnch the entire 
CFTR cDNA protein encoding region may be maintained and/or expressed include other 
1 0 bacterial species and yeast It is not possible a priori to predict which cells might be resistant 
and which might not Screening a number of different host/vector combinations is necessary 
to find a suitable host tolerant of expression of the fiill length protein or potentially toxic 
fragments thereof. 



15 

Example 7 - Generation of Aden ovirus Vector Encoding CFTR f Ad2/CFTR1 
1 , DNA preparation - Construction of the recombinant Ad2/CFTR-1 virus (the sequence 
of which is shown in Table n and as SEQ ID N0:3) was accomplished as follows: The 
CFTR cDNA was excised from the plasmid pCNrV-CFTR-936C using restriction enemies 

20 Spel and EcII361 . pCMV-CFTR-936C consists of a minimal CFTR cDNA encompassing 
nucleotides 123-4622 of the published CFTR sequence cloned into the multiple cloning site 
of pRC/CMV (Invitrogen Corp.) using synthetic linkers. The CFTR cDNA within this 
plasmid has been completely sequenced. The Spsl/Ecinfil restriction fragment contains 47 
bp of 5' sequence derived fi^m synthetic linkers and the multiple cloning site of the vector. 

25 The CFTR cDNA (the sequence of which is shown as SEQ ID NO: 1 and the amino 

acid sequence encoded by the CFTR cDNA is shown as SEQ ID N0:2) was inserted between 
the Nhel and SnaBl restriction sites of the adenovirus gene transfer vector pBR-Ad2-7. pBR- 
Ad2-7 is a pBR322 based plasmid containing an approximately 7 kb insert derived from the 
5' 10680 bp of Ad2 inserted between the CM and BamHl sites of pBR322. From this Ad2 
. 30 fragment, the sequences corresponding to Ad2 nucleotides 546-3497 were deleted and 

replaced with a 12 bp multiple cloning site containing an Nhel site, an MM site, and a SnaBl 
site. The construct also contains the 5' inverted terminal repeat and viral packaging signals, 
the Ela enhancer and promoter, the Elb 3* intron and the 3' untranslated region and 
polyadenylation sites. The resulting plasmid was called pBR-Ad2-7/CFTR. Its use to 

35 assemble virus is described below. 



2. Virus Preparation from DNA - To generate the recombinant Ad2/CFTR-1 adenovirus, 
the vector pBR-Ad2-7/CFTR was cleaved with BstBl at the site corresponding to the unique 
BstBl site at 10670 in Ad2. The cleaved plamid DNA was ligated to BstBl restricted Ad2 
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DNA. Following ligation, the reaction was used to transfect 293 cells by the calcium 
phosphate procedure. Approximately 7-8 days following transfection, a single plaque 
appeared and was used to reinfect a dish of 293 cells. Following development of cytopathic 
effect (CPE), the medium was removed and saved. Total DNA was prepared from the 
5 infected cells and analyzed by restriction analysis with multiple enzymes to verily the 
integrity of the construct. Viral supernatant vm then used to infect 293 cells and upon 
development of CPE, expression of CFTR was assayed by the protein kinase A (PKA) 
immunoprecipitation assay (Gregory, R.J. et al. (1990) Nature 347:382 ). Following these 
verification procedures, the virus was fiirther purified by two rounds of plaque purification. 
1 0 Plaque purified virus was grown into a small seed stock by inoculation at low 

multiplicities of infection onto 293 cells grown in monolayers in 925 medium supplemented 
with 10% bovine calf serum. Material at this stage was designated a Research Viral Seed 
Stock (RVSS) and was used in all preliminary experiments. 

15 3. Vims Host Cell - Ad2/CFTR-1 is propagated in human 293 cells (ATCC CRL 1573). 
These cells are a human embryonal kidney cell line which were immortalized with sheared 
fragments of human Ad5 DNA. The 293 cell line expresses adenovirus early region 1 gene 
products and in consequence, will support the growth of El deficient adenoviruses. By 
analogy with retroviruses, 293 cells could be considered a packaging cell line, but they differ 

20 horn usual retrovirus lines in that they do not provide missii^ viral structural proteins, rather, 
they provide only some missing viral early functions. 

Production lots of virus are propagated in 293 cells derived from the Workmg Cell 
Bank (WCB). The WCB is in turn derived from the Master Cell Bank (MCB) which was 
grown up from a fresh vial of cells obtained from ATCC. Because 293 cells are of human 

25 origin, they are being tested extensively for the presence of biological agents. The MCB and 
WCB are being characterized for identity and the absence of adventitious agents by 
Microbiological Associates, Rockville, MD. 

4. Growth of Pro Hnrtinn T.nts of Virus 
. 30 Production lots of Ad2/CFTR-1 are produced by inoculation of approximately 5-10 x 

10^ pfii of MVSS onto approximately 1-2 x 10^ Web 293 cells grown in a T175 flask 
containing 25 mis of 925 medium. Inoculation is achieved by direct addition of the virus 
(approximately 2-5 mis) to each flask. Batches of 50-60 flasks constitute a lot. 

FoUowmg 40-48 hours incubation at 37°C, the cells are shaken loose from the flask 
35 and transferred with medium to a 250 ml centrifuge bottle and spun at 1 000 xg. The cell 
pellet is resuspended in 4 ml phosphate buffered saline containing 0.1 g/1 CaCl2 and O.lg/1 
MgCl2 and the cells subjected to cycles of freeze-thaw to release virus. Cellular debris is 
removed by centrifugation at 1000 xg for 15 min. The supernatant firam this centrifiigation is 
layered on top of the CsCl step gradient: 2 ml 1 .4g/nJ CsCl and 3 ml 1 .25g/ml CsCl in 1 0 
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mM Tris, 1 mM EDTA (TE) and spun for 1 hour at 35,000 rpm in a Beckman SW41 rotor. 
Virus is then removed from the interface between the two CsCl layers, mixed with 1 .35 g/ml 
CsCl in TE and then subjected to a 2.5 hour equilibriuni centrifugation at 75,000 rpm in a 
TLN-100 rotor. Virus is removed by puncturing the side of the tube with a hypodermic 
5 needle and gently removing the banded virus. To reduce the CsCl concentration, the sample 
is dialyzed against 2 changes of 2 liters of phosphate buffered saline with 1 0% sucrose. 

Following this procedure, dialyzed virus is stable at 4X for several weeks or can be 
stored for longer periods at -SO'^C. Aliquots of material for human use will be tested and 
while awaiting the results of these tests, the remamder will be stored frozen. The tests to be 
10 performed are described below: 

5. Structure and Purity of Virus 

SDS polyacrylamide gel electrophoresis of purified virions reveals a number of 
polypeptides, many of which have been characterized. When preparations of virus were 
15 subjected to one or two additional rounds of CsCl centrifugation, the protein profile obtained 
was mdistinguishable. This indicates that additional equilibrium centrifugation does not 
purify the virus further, and may suggest that even the less mtense bands detected in the virus 
preparations represent minor virion components rather than contaminating proteins. The 
identity of the protein bands is presently being established by N-termmal sequence analysis. 

20 

6. rnntflminatiTig Materials - The material to be administered to patients will be 2 x 1 o6 
pfu, 2 X 107 pfu and 5 X 107 pfu of purified Ad2/CFTR-1 . Assuming a minimum particle to 
pfu ratio of 500, this corresponds to 1 x lO^, 1 x 10^0 and 2.5 x lO^O viral particles, these 
correspond to a dose by mass of 0.25 jig, 2.5jig and 6,25 fig assuming a moieuclar mass for 

25 adenovirus of 1 50 x 1 O^. 

The origin of the materials from which a production lot of the purified Ad2/CFTR-1 
is derived was described in detail above and is illustrated as a flow diagram in Figure 6. All 
the starting materials from which the purified virus is made (i.e., MCB, and WCB, and the 
MVSS) will be extensively tested. Further, the growth medium used will be tested and the 

30 serum will be from only approved suppliers who will provide test certificates. In this way, all 
the components used to generate a production lot will have been characterized. Following 
growth, the production lot virus will be purified by two roxmds of CsCl centrifugation, 
dialyzed, and tested. A production lot should constitute 1-5 x 10l0pfuAd2/CFTR-L 

As described above, to detect any contaminating material aliquots of the production 

35 lot will be analyzed by SDS gel electrophoresis and restriction enzyme mapping. However, 
these tests have limited sensitivity. Indeed, unlike the situation for purified single chain 
recombinant proteins, it is very difficult to quantitate the purity of the AD2/CFTR-1 using 
SDS polyacrylamide gel electrophoresis (or similar methods). An alternative is the 
immunological detection of contaminating proteins (IDCP). Such an assay utilizes antibodies 
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raised against the proteins purified in a mock purification run. Development of such an assay 
has not yet been attempted for the CsCl purification scheme for Ad2/CFTR-1 . However, 
initially an IDCP assay developed for the detection of contaminants in recombmant protems 
produced in Chinese hamster ovary (CHO) cells will be used. In addition, to hamster 
5 proteins, these assays detect bovine serum albumin (BSA), transferrin and IgG heavy and 
light chain derived from the serum added to the growth medium. Tests using such reagents to 
examine research batches of Ad2/CFTR-1 by both ELISA and Westem blots are in progress. 

Other proteins contaminating the virus preparation are likely to be from the 293 cells - 
that is, of human origin. Human proteins contaminating therapeutic agents derived from 

1 0 human sources are usually not problematic. In this case, however, we plan to test the 
production lot for transforming factors. Such factors could be activities of contammating 
human proteins or of the Ad2/CFTR-1 vector or other contaminating agents. For the test, it is 
proposed that 10 dishes of Rat 1 cells containing 2 x 10^ cells (the number of target cells in 
the patient) with 4 times the highest human dose of Ad2/CFTR-1 (2 x 10^ pfu) will be 

1 5 infected. Following infection, the cells will be plated out in agar and examined for the 

appearance of transformed foci for 2 weeks. Wild type adenovirus will be used as a control. 

Nucleic acids and proteins would be expected to be separated from purified virus 
preparations upon equilibrium density centrifiigation. Furthermore, the 293 cells are not 
expected to contam VL30 sequences. Biologically active nucleic cells should be detected. 

20 

Example 8 - Prelhnin arv Experiments Testing the Abilitv of Ad2/BGal or Ad2/CFTR Virus 
tQ Entgr Airway Epithdial Cells 

a. Hamgtgr Stydigs 

25 Initial studies mvolving the intratracheal instillation of the Ad-pGal viral vector into 

Syrian hamsters, which are reported to be permissive for human adenovirus are being 
performed. The first study, a time course assessment of the pulmonary and systemic acute 
inflammatory response to a single intratracheal administration of Ad-PGal viral vector, has 
been completed. In this study, a total of 24 animals distributed among three treatment 

30 groups, specifically, 8 vehicle control, 8 low dose virus (1 x 10^ ^ particles; 3 x 10^ pfii), and 
8 high dose virus (1.7 x 10^2 particles; 5 x 10^ pfu), were used. Within each treatment 
group, 2 animals were analyzed at each of four time points after viral vector instillation: 6 
hrs, 24 hrs, 48 hrs, and 7 days. At the time of sacrifice of each animal, lung lavage and blood 
samples were taken for analysis. The lungs were fixed and processed for normal light-level 

35 histology. Blood and lavage fluid were evaluated for total leukocyte count and leukocyte 
differential. As an additional measure of the inflammatory process, lavage fluid was also 
evaluated for total protein. Following embeddings, sectioning and hematoxylin/eosin 
staining, lung sections were evaluated for signs of inflammation and airway epithelial 
damage. 
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Wiih the small sample size, the data from this preliminary study were not amenable to 
statistical analyses, however, some general trends could be ascertained. In the peripheral 
blood samples, total leukocyte counts showed no apparent dose- or time- dependent changes. 
In the blood leukocyte differential counts, there may have been a minor dose-related 
5 elevation in percent neutrophil at 6 hours; however, data from all other time points showed no 
elevation in neutrophil percentages. Taken together, these data suggest little or nor systemic 
inflanunatory response to the viral administration. 

From the lung lavage, some elevation in total neutrophil comts were observed at tiie 
first three tune points (6 hr, 24 hr, 48 hr). By seven days, both total and percent neutrophil 
1 0 values had returned to normal range. The trends in lung lavage protein levels were more 
difficult to assess due to inter-animal variability; however, no obvious dose- or time- 
dependent effects were apparent First, no damage to airway epithelium was observed at any 
time point or virus dose level Second, a time- and dose- dependent mild inflammatory 
response was observed, bemg maximal at 48 hr in the high vuus dose animals. By seven 
1 5 days, the inflammatory response had completely resolved, such that the lungs from animals in 
all treatment groups were indistinguishable. 

In siumnary, a mild, transient, pulmonary inflammatory response appears to be 
associated with the intratracheal administration of the described doses of adenoviral vector in 
the Syrian Hamster. 

20 A second, single intratracheal dose, hamster study has been initiated. This study is 

designed to assess the possibility of the spread of ineffective viral vectors to organs outside of 
the lung and the antibody response of the animals to the adenoviral vector. In this study, the 
three treatment groups (vehicle control, low dose virus, high dose virus) each contained 12 
animals. Animals will be evaluated at three time pomts: 1 day, 7 days, and 1 month. In this 

25 study, viral vector persistence and possible spread will be evaluated by the assessment of the 
presence of infective virions in numerous organs including lung, gut, heart, liver, spleen, 
kidney, brain and gonads. Changes in adenoviral antibody titer will be measured in 
peripheral blood and lung lavage. Additionally, lung lavage, peripheral blood and limg 
histology will be evaluated as in the previous study. 

30 

b. Primate studies. 

Studies of recombinant adenovirus are also underway in primates. The goal of these 
studies is to assess the ability of recombinant adenoviral vectors to deliver genes to the 
respiratory epithelium in vivo and to assess the safety of the construct in primates. Initial 
35 studies in primates targeted nasal epithelia as the site of infection because of its similarity to 
lower airway epithelia, because of its accessibility, and because nasal epithelia was used for 
the first human studies. The Rhesus monkey (Macaca mulatto) has been chosen for studies, 
because it has a nasal epithelium similar to that of humans. 
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How expression of CFTR affects the electrolyte transport properties of the nasal 
epithelium can be studied in patients with cystic fibrosis. But because the primates have 
normal CFTR function, instead the ability to transfer a reporter gene was assessed. Therefore 
the Ad-pGal virus was used. The epithelial cell density in the nasal cavity of the Rhesus 
5 monkey is estimated to be 2 x 10^ cells/cm (based on an average nasal epithelial cell 

diameter of 7 jim) and the surface near 25-50 cm^. Thus, there are about 5 x 10^ cells in the 
nasal epithelium of Rhesus monkey. To focus especially on safety, the higher viral doses 
(20-200 MOI) were used in vivo. Thus doses in the range of lO^-lO*® pfo were used. 

In the first pilot study the right nostril of Monkey A was infected with Ad-p-Gal (-1 

1 0 ml). This viral preparation was purified by CsCl gradient centrifugation and then by gel 
filtration chromatography one week later. Adenoviruses are typically stable in CsCl at 4°C 
for one to two weeks. However, this viral preparation was found to be defective (i.e., it did 
not produce detectable p-galactosidase activity in tiie permissive 293 cells). Thus, it was 
concluded that there was no live viral activity in the material, p-galactosidase activity in 

1 5 nasal epithelial cells Scorn Monkey A was also not detected. Therefore, in the next study, two 
different preparations of Ad-p-Gal virus: one diat was purified on a CsCl gradient and then 
dialyzed against Tris-buffeied saline to remove the CsCl, and a crude impurified one was 
used. Titers of Ad-P-Gal viruses were -2 x lOlO pfii/ml and > 1 x 10l3 pfu/ml, respectively, 
and both preparations produced detectable p-galactosidase activity in 293 cells. 

20 Monkeys were anesthetized by intramusciilar injection of ketamine ( 1 5 mg/kg). One 

week before administration of virus, the nasal mucosa of each monkey was brushed to 
establish baseline cell differentials and levels of P-galactosidase. Blood was drawn for 
baseline determination of cell differentials, blood chemistries, adenovirus antibody titers, and 
viral cultures. Each monkey was also examined for weight, temperature, appetite, and 

25 general health prior to infection. 

The entire epithelium of one nasal cavity was used in each monkey. A foley catheter 
(size 10) was inserted through each nasal cavity into the pharynx, inflated with 2-3 ml of air, 
and then pulled anteriorly to obtain tight posterior occlusion at the posterior choana. Both 
nasal cavities were then irrigated with a solution (-5 ml) of 5 mM dithiothreitol plus 0.2 U/ml 

30 neuraminidase in phosphate-buffered saline (PBS) for five minutes. This solution was used 
to dissolve any residual mucus overlaying the epithelia. (It was subsequently found that such 
treatment is not required.) The washing procedure also allowed the determination of whether 
the balloons were effectively isolating the nasal cavity. The vhiis (Ad~P-Gal) was then 
slowly instilled into the right nostril with the posterior balloon inflated. The viral solution 

35 remained in contact with the nasal mucosa for 30 minutes. At the end of 30 minutes, the 
remaining viral solution was removed by suction. The balloons were deflated, the catheters 
removed, and the monkey allowed to recover from anesthesia. Monkey A received the CsCl- 
purified virus (-1.5 ml) and Monkey B received the crude virus (-6 ml), (note that this was 
the second exposure of Monkey A to the recombinant adenovirus). 
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Both monkeys were followed daily for appearance of the nasal mucosa, conjunctivitis, 
appetite, activity, and stool consistency. Each monkey was subsequently anesthetized on 
days 1, 4, 7, 14, and 21 to obtain nasal, pharyngeal, and tracheal cell samples (either by 
swabs or brushes) as described below. Phlebotomy was performed over the same time course 
5 for hematology, ESR, general screen, antibody serology and vbal cultures. Stools were 
collected every week to assess viral cultures. 

To obtain nasal epithelial cells from an anesthetized monkey, the nasal mucosa was 
first impregnated with 5 drops of Afrih (0.05% oxymetazoline hydrochloride, Schering- 
Plough) and 1 ml of 2% Lidocaine for 5 min. A cytobrush (the kind typically used for Pap 
1 0 smears) was then used to gently mb the mucosa for about 1 0 seconds. For tracheal bnishings, 
a flexible fiberoptic bronchoscope; a 3 mm cytology brush (Bard) was advanced through the 
bronchoscope into the trachea, and a small area was brushed for about 1 0 seconds. This 
procedure was repeated twice to obtain a total of -lO^ cells/ml. Cells were then collected on 
slides (approximately 2 x 10^ cells/slide usmg a Cytospin 3 (Shandon, PA)) for subsequent 
1 5 staining (see below). 

To determine viral efScacy, nasal, pharyngeal, and tracheal cells were stained for p- 
galactosidase using X-gal (5 bromo-4-chloro-3-indolyl-p-D-galactoside). Cleavage of X-gal 
by P-galactosidase produces a blue color that can be seen with light microscopy. The Ad-p- 
gal vector mcluded a nuclear-localization signal (MLS) (from SV40 large T-antigen) at the 
20 amino-terminus of the p-galactosidase sequence to direct expression of this protein to the 
nucleus. Thus, the number of blue nuclei after staining was determined. 

RT-PCR (reverse transcriptase-polymerase chain reaction) was also used to determine 
viral efScacy. This assay indicates the presence of P-galactosidase mRNA in cells obtained 
by bnishings or swabs. PGR primers were used in both the adenovuiis sequence and the 
25 LacZ sequence to distinguish virally-produced mRNA from endogenous mRNA. PCR was 
also used to detect the presence of the recombinant adenovuus DNA. Cytospin preparations 
was used to assess for the presence of virally produced p-galactosidase mRNA in the 
respiratory epithelial cells using in-situ hybridization. This technique has the advantage of 
being highly specific and will allow assessment which cells are producing the mRNA. 
• 30 Whether there was any inflammatory response was assessed by visual inspection of 

the nasal epithelium and by cytological examination of Wright-stained cells (cytospin). The 
percentage of neutrophils and lymphocytes were compared to that of the control nostril and to 
the normal values fix)m four control monkeys. Systemic repsonses by white blood cell 
counts, sedimentation rate, and fever were also assessed. 
35 Viral replication at each of the time points was assessed by testing for the presence of 

live virus in the supernatant of the cell suspension from swabs or brushes. Each supernatant 
was used to infect (at several dilutions) the virus-sensitive 293 cell line, Cytopathic changes 
in the 293 cells were monitored for 1 week and then the cells were fixed and stained for p- 
galactosidase. Cytopathic effects and blue-stained cells indicated the presence of live virus. 
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Positive supematants will also be subjected to analysis of nonintegrating DNA to identify 
(confirm) the contributing virus(es). 

Antibody titers to type 2 adenovirus and to the recombinant adenovirus were 
determined by ELISA. Blood/serum analysis was performed using an automated chemistry 
5 analyzer Hitachi 737 and an automated hematology analyzer Technicom H6, The blood 
buffy coat was cultured in AS49 cells for wild type adenovirus and was cultured in the 
permissive 293 cefls. 

Results: Both monkeys tolerated the procedure well. Daily examination revealed no 
evidence of coryza, conjunctivitis or diarrhea. For both monkeys, the nasal mucosa was 
10 mildly erythematous in both the infection side and the control side; this was interpreted as 
being due to the instrumentation. Appetites and weights were not affected by virus 
administrated in either monkey. Physical examination on days 1, 4,7, 14 and 21 revealed no 
evidence of lymphadenopathy, tachypnea, or tachycardia. On day 21, monkey B had a 
temperature 39 J°C (normal for Rhesus monkey 38.8°C) but had no other abnormalities on 
15 physical exam or in laboratory data. Monkey A had a slight leukocytosis on day 1 post 
infection which returned to normal by day 4; the WBC was 4,920 on the day of mfection, 
8,070 on day 1, and 5,200 on day 4. The ESR did not change after the mfection. Electrolytes 
and transaminases were normal throughout. 

Wright stains of cells from nasal brushing were performed on days 4, 7, 14, and 21, 
20 They revealed less than 5% neutrophils and lymphocytes. There was no difference between 
the infected and the control side. 

X"Gal stains of the pharyngeal swabs revealed blue-stained cells in both monkeys on 
days 4, 7, and 14; only a few of the cells had clear nuclear localization of the pigment and 
some pigment was seen in extracellular debris. On day 7 post infection, X-Gal stains from 
25 the right nostril of monkey A, revealed a total of 135 ciliated cells with nuclear-localized blue 
stain. The control side had only 4 blue cells Monkey B had 2 blue cells from the infected 
nostril and none from the control side. Blue cells were not seen on day 7, 14, or 21 . 

RT-PCR on day 3 post infection revealed a band of the correct size that hybridized 
with a p-Gal probe, consistent with P-Gal mRNA in the samples from Monkey A control 
. 30 nostril and Monkey B infected nostril. On day 7 there was a positive band in the sample from 
the mfected nostril of Monkey A, the same specimen that revealed blue cells. 

Fluid from each nostril, the pharynx, and trachea of both monkeys was placed on 293 
cells to check for the presence of live virus by cytopathic effect and X-Gal stain. In Monkey 
A, live virus was detected in both nostrils on day 3 after infection; no live virus was detected 
35 at either one or two weeks post-infection. In Monkey B, live virus was detected in both 
nostrils, pharynx, and trachea on day 3, and only in the infected nostril on day 7 after 
infection. No live virus was detected 2 weeks after the infection. 
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c. Ybmm ExpHmt gtwdiies 

In a second type of experiment, epithelial cells from a nasal polyp of a CF patient 
were cultured on permeable filter supports. These cells form an electrically tight epithelial 
monolayer after several days in culture. Eight days after seeding, the cells were exposed to 
5 the Ad2/CFTR virus for 6 hours. Three days later, the short-circuit current (Isc) across the 
monolayer was measured. cAMP agonists did not increase the Isc, indicating that there was 
no change in chloride secretion. However, this defect was corrected after infection with 
recombinant Adl/CFTR. Cells infected with Ad2/CFTR (M0I=5; MOI refers to multiplicity 
of infection; 1 MOI indicates one pfij/cell) express functional CFTR; cAMP agonists 

10 stimulated Isc, mdicating stimulation of CI" secretion. Ad2/CFTR also corrected the CF 
chloride channel defect in CF tracheal epithelial cells. Additional studies indicated that 
Ad2/CFTR was able to correct the chloride secretory defect without altering the 
transepithelial electrical resistance; this result indicates that the integrity of the epithelial cells 
and the tight junctions was not disrupted by infection with Ad2/CFTR. Application of 1 MOI 

1 5 of Ad2/CFTR was also found to be suflBcient to correct the CF chloride secretory defect. 

The e?q)eriments using primary cultures of human airway epithelial cells indicate that 
the Ad2/CFTR virus is able to enter CF airway epithelial cells and express sufficient CFTR to 
correct the defect in chloride transport. 

20 Example 9 -In Vivo Delivery to and Expresf^inn of CFTR in Cotton Rat and Rhesus Monkey 

Epithelium 

MATERIALS AND METHODS 

Adgnoviros vector 

25 Ad2/CFTR-1 was prepared as described in Example 7. The DNA construct comprises 

a full length copy of the Ad2 genome of approximately 37.5 kb from which the early region 1 
genes (nucleotides 546 to 3497) have been replaced by cDNA for CFTR (nucleotides 123 to 
4622 of the published CFTR sequence with 53 additional linker nucleotides). The yiral Ela 
promoter yvas used for CFTR cDNA. Termination/polyadenylation occurs at the site 

30 normally used by the Elb and protein IX transcripts. The recombinant virus E3 region was 
conserved. The size of the Ad2-CFTR-1 vector is approximately 104.5% that of wild-type 
adenovirus. The recombinant virus was grown in 293 cells that complement the El early 
viral promoters. The cells were frozen and thawed three times to release the virus and the 
preparation was purified on a CsCl gradient, then dialyzed agamst Tris-buffered saline (TBS) 

35 to remove the CsCl, as described. 
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Rats, Twenty two cotton rats (6-8 weeks old, weighing between 80-100 g) were used 
for this study. Rats were anesthetized by inhaled methoxyflurane (Pitman Moore, Inc., 
Mundelen, 111). Virus was applied to the lungs by nasal instillation during inspiration. 
5 Two cotton rat studies were performed. In the first study, seven rats were assigned to 

a one time pulmonary infection with 100 yX solution containing 4.1 x 10^ plaque forming 
units (pfii) of the Ad2/CFTR-1 virus and 3 rats served as controls. One control rat and either 
two or three experimental rats were sacrificed with methoxyflurane and studies at each of 
three time points: 4, 1 1, or 15 days after infection. 

10 The second group of rats was used to test the effect of repeat administration of the 

recombinant virus. All 12 rats received 2.1 x 10^ pfii of the Ad2/CFTR-1 virus on day 0 and 
9 of the rats received a second dose of 3.2 x 10^ pfu of Ad2/CFTR-1 14 days later. Groups 
of one control rat and three experimental rats were sacrificed at 3, 7, or 14 days after the 
second administration of virus. Before necropsy, the trachea was cannulated and 

1 5 brochoaveolar lavage (BAL) was performed with 3 ml aliquots of phosphate-buffered saline. 
A median stemotomy was performed and the right ventricle cannulated for blood collection. 
The right lung and trachea were fixed in 4% formaldehyde and the left lung was fi-ozen in 
liquid nitrogen arid kept at -70°C for evaluation by immunochemistry, reverse transcriptase 
polymerase cham reaction (RT-PCR), and viral culture. Other organs were removed and 

20 quickly firozen in liquid nitrogen for evaluation by polymerase chain reaction (PGR). 

Monkeys. Three female Rhesus monkeys were used for this study; a fourth female 
monkey was kept in the same room, and was used as control. For application of the virus, the 
monkeys were anesthetized by intramuscular injection of ketamine (15 mg/kg). The entire 
epithelixim of one nasal cavity in each monkey was used for virus application. A foley 

25 catheter (size 1 0) was inserted through each nasal cavity into the pharynx, the balloon was 
inflated with 2-3 ml of air, and then pulled anteriorly to obtain a tight occlusion at the 
posterior choana. The Ad2/CFTR-1 virus was then instilled slowly in the right nostril with 
the posterior balloon inflated. The viral solution remained in contact with the nasal mucosa 
for 30 min. The balloons were deflated, the catheters were removed, and the monkeys were 
. 30 allowed to recover firom anesthesia. A similar procedure was performed on the left nostril, 
except that TBS solution was instilled as a control. The monkeys received a total of three 
doses of the virus over a period of 5 months. The total dose given was 2.5 x 109 pfii the first 
time, 2.3 x 10^ pfii the second time, and 2.8 x 10^ pfii the third time. It was estimated that 
the cell density of the nasal epithelia to be 2 x 10^ cells/cm^ and a surface area of 25 to 50 

35 cm2. This corresponds to a multiplicity of infection (MOI) of approximately 25. 

The animals were evaluated 1 week before the first administration of virus, on the day 
of administration, and on days 1, 3, 6, 13, 21, 27, and 42 days after infection. The second 
administration of virus occurred on day 55. The monkeys were evaluated on day 55 and then 
on days 56, 59, 62, 69, 76, 83, 89, 96, 103, and 1 1 1 . For the third administration, on day 134, 
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only the left nostril was cannulated and exposed to the virus. The control monkey received 

instillations of PBS instead of virus. Biopsies of the left medial turbinate were carried out on 

day 135 in one of the infected monkeys, on day 138 on the second infected monkey, and on 

day 142 on the third infected monkey and on the control monkey. 
5 For evaluations, monkeys were anesthetized by intramuscular injection of ketamine 

(1 S mg/kg). To obtain nasal epithelial cells, the nasal mucosa was first impregnated with 5 

drops of Afnn (0.05% oxymetazoline hydrochloride, Schering-Plough) and 1 ml of 2% 

Lidocaine for 5 minutes. A cytobrush was then used to gently rub the mucosa for about 3 

sec. To obtain pharyngeal epithelial swabs, a cotton-tipped applicator was rubbed over the 
10 back of the pharynx 2-3 times. The resulting cells were dislodged firom brushes or 

applicators into 2 ml of sterile PBS. Biopsies of the medial turbinate were performed using 

cupped forceps under direct endoscopic control. 

Animals were evaluated daily for evidence of abnormal behavior of physical signs. A 

record of food and fluid intake was used to assess appetite and general health. Stool 
1 5 consistency was also recorded to check for the possibility of diarrhea. At each of the 

evaluation time points, rectal temperature, respiratory rate, and heart rate were measured. 

The nasal mucosa, conjunctivas, and pharynx were visually inspected. The monkeys were 

also examined for lymphadenopathy. 

Venous blood firom the monkeys was collected by standard venipuncture technique. 
20 Blood/serum analysis was performed in the clinical laboratory of the University of Iowa 

Hospitals and Clmics using a Hitachi 737 automated chemistry analyzer and a Technicom H6 

automated hematology analyzer. 

SCTology 

25 Sera were obtained and anti-adenoviral antibody titers were measured by an enzyme- 

linked immunoadsorbant assay (ELISA). For the ELISA, 50 ng/well of filled adenovirus 
(Lee Biomolecular Research Laboratories, San Diego, Ca) in 0.1 M NaHC03 coated on 
96 well plates at 4°C overnight. The test samples at appropriate dilutions were added, 
starting at a dilution of 1/50. The samples were incubated for 1 hour, the plates washed, and 

30 a goat anti-human IgG HRP conjugate (Jackson ImmunoResearch Laboratories, West Grove, 
PA) was added and incubated for 1 hour. The plates were washed and O-Phenylenediamine 
(Sigma Chemical Co., St. Louis, MO) was added for 30 min. at room temperature. The assay 
was stopped with 4.5 M H2SO4 and read at 490 nm on a Molecular Devices microplate 
reader. The titer was calculated as the product of the reciprocal of the initial dilution and the 

35 reciprocal of the dilution in the last well with an OD>0. 1 00. 

Neutralizing antibodies measiire the ability of the monkey serum to prevent infection 
of 293 cells by adenovirus. Monkey serum (1:25 dilution) [or nasal washings (1 :2 dilutions)] 
was added in two-fold serial dilutions to a 96 well plate. Adenovirus (2.5 x 10^ pfu) was 
added and incubated for 1 hour at 37'*C. The 293 cells were then added to all wells and the 
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plates were incubated until the serum-free control wells exhibited >9S% cytopathic effect. 
The titer was calculated as the product of the reciprocal of the initial dilution times the 
reciprocal of the dilution in the last well showing >95% cytopathic effect. 

5 Brpnchoalvgp^w lavage md nas^ brushmgs for cytology 

Bronchoalveolar lavage (BAL) was performed by cannulating the trachea with a 
silastic catheter and injectmg S ml of PBS. Gentle suction was applied to recover the fluid. 
The BAL sample was spun at SOOO rpm for 5 min. and cells were resuspended in 293 media 
at a concentration of 10^ cells/mL Cells were obtained from the monkey's nasal epithelium 
10 by gently rubbing the nasal mucosa for about 3 sec. with a cjrtobrush. The resulting cells 
were dislodged from the brushes mto 2 ml of PBS. Forty microliters of the cell suspension 
were cytocentrifuged onto slides and stained with Wright's stain. Samples were examined by 
light microscopy. 

15 

Histology of lung sections and nasal biopsies 

The right lung of each cotton rat was removed, inflated with 4% formaldehyde, and 
embedded in paraffin for sectioning. Nasal biopsies from the monkeys were also fixed with 
4% formaldehyde. Histologic sections were stained with hematoxylin and eosin (H&E), 
20 Sections were reviewed by at least one of the study personnel and by a pathologist who was 
unaware of the treatment each rat received. 



Immwnopytochgmiistry 

Pieces of limg and trachea of the cotton rats and nasal biopsies were frozen in liquid 
25 nitrogen on O.C.T. compound. Cryosections and paraffin sections of the specimens were 
used for immimofluorescence microscopy. Cytospin slides of nasal brushings were prepared 
on gelatin coated slides and fixed with paraformaldehyde. The tissue was permeabilized with 
Triton X-100, then a pool of monoclonal antibodies to CFTR (M13-1, Ml-4) (Denning, G.M. 
et al. (1992) J. Clin. Invest. 89:339-349) was added and mcubated for 12 hours. The primary 
. 30 antibody was removed and an anti-mouse biotinylated antibody (Biomeda, Foster City, CA) 
was added. After removal of the secondary antibody, streptavidin FITC (Biomeda, Foster 
City, Ca) was added and the slides were observed under a laser scanning confocal 
microscope. Both control animal samples and non-immime IgG stained samples were used as 
controls. 

35 

PGR 

PCR was performed on pieces of small bowel, brain, heart, kidney, liver, ovaries, and 
spleen from cotton rats. Approximately 1 g of the rat organs was mechanically ground and 
mixed with 50 ill sterile water, boiled for 5 min., and centrifiiged. A 5 iil aliquot of the 
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supematant was removed for further analysis. Monkey nasal brushings suspensions were 
also used for PGR. 

Nested PGR primer sets were designed to selectively amplify Ad2/GFTR-1 DNA over 
endogenous GFTR by placing one primer from each set in the adenovirus sequence and the 
5 other primer in the GFTR sequence. The first primer set amplifies a 723 bp firagment and is 
shown below: 

Ad2 5' AGT GTT GAG TGG GAG GGA GTA GAG TTT TGT GGT GGG 3* (SEQ ID 
N0:4) 

GFTR 5' GGA AAG GAG GGA TGG AGA GGA AAT GTG GG 3' (SEQ ID N0:5) 
10 The nested primer set amplifies a 506 bp fiagment and is shown below: 
Ad2 5' GTG GTG GGA GGG GGT GGG AGG TAG 3' (SEQ ID N0:6) 
GFTR 5' GGA AAA ATG GGT GGG TGT AGG AGG AGT GTG G 3' (SEQ ID N0:7) 

A PGR reaction mfac containing lOmM Tris-Gl (pH 8.3), 50mM KGl, 1 .5 mM MgGl2, 
0.001% (w/v) gelatin, 400 fiM each dNTP, 0.6 ^iM each primer (first set), and 2.5 units 
1 5 AmpliTaq (Perkin Ekner) was aliquoted into separate tubes. A 5 ^1 aliquot of each sample 
prep was then added and the mixture was overlaid with 50 ^1 of light mineral oil. The 
samples were processed on a Bamstead/Thennolyne (Dubuque, lA) thermal cycler 
programmed for 1 mm. at 94'*G, 1 mm. at 65**G, and 2 mm. at 72**C for 40 cycles. Post-run 
dwell was for 7 min. at 72°G. A 5 ^1 aliquot was removed and adde^ to a second PGR 
20 reaction using the nested set of primers and cycled as above. A 1 0 jxl aliquot of the final 
amplification reaction was aimlyzed on a 1% agarose gel and visualized with ethidium 
bromide. 

To determine the sensitivity of this procedure, a PGR mix containing control rat liver 
supematant was aliquoted into several tubes and spiked with dilutions of Ad2/GFTR-1 • 
25 Following the amplification protocols described above, it was determined that the nested 
PGR procedure could detect as little as 50 pfu of viral DNA. 

RT-PCR 

RT-PGR was used to detect vector-generated mRNA in cotton rat lung tissue and 
. 30 samples fi*om nasal brushings fi-om monkeys. A 200 jil aliquot of guanidine isothiocyanate 
solution (4 M guanidine isothiocyanate, 25 mM sodium citrate pH 7.0, 0.5% sarcosyl, and 0.1 
M P-mercaptoethanol) was added to a fi-ozen section of each lung and pellet from nasal 
brushings and the tissue was mechanically groxmd. Total RNA was isolated utilizing a 
single-step method (Chomczynski, P. and Sacchi, N. et al. (1987) Analytical Biochemistry 
35 162:156-159; Hanson, C.A. et al. (1990) Am, J. Pathol 137:1-6). The RNA was mcubated 
with 1 unit RQl RNase-free DNase (Promega Corp., Madison WI)) at 37°C for 20 min., 
denatured at 99**C for 5 min., precipitated with ammonium acetate and ethanol, and 
redissolved in 4 ^1 diethylpyrocarbonate treated water containing 20 vmits RNase Block 1 
(Stratagene, La JoUa CA). A 2 ^1 aliquot of the purified RNA was reverse transcribed using 
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the GeneAmp RNA PGR kit (Perkin Elmer Cetus) and the downstream primer from the first 
primer set described in the previous section. Reverse transcriptase was omitted from the 
reaction with the remaining 2 |il of the purified RNA prep, as a control in which preparations 
(both +/- RT) were then amplified using nested primer sets and the PGR protocols described 
5 above. A 10 ^l aliquot of the final amplification reaction was analyzed on a 1% agarose gel 
and visualized with ethidium bromide. 



Soythgm analy{?is. 

To verify the identity of the PGR products. Southern analysis was performed. The 
1 0 DNA was transferred to a nylon membrane as described (Sambrook si al., supra). A 
fragment of GFTR cDNA (ammo acids #1-525) was labeled with [32p].dCTP (IGN 
Biomedicals, Inc. Irvine GA) using an oligolabelmg kit (Pharmacia, Piscataway, NJ) and 
purified over a NIGK column (Pharmacia Piscataway, NJ) for use as a hybridization probe. 
The labeled probe was denatured, cooled, and incubated with the prehybridized filter for 1 5 
15 hours at 42'*G. The hybridized filter was then exposed td fihn (Kodak XAR-5) for 10 min. 



Culture of Ad2/CFTR-1 

20 Viral cultures were performed on the permissive 293 cell line. For culture of virus 

from lung tissue, 1 g of lung was frozen/thawed 3-6 times and then mechanically dismpted in 
200 ^il of 293 media. For culture of BAL and monkey nasal brushings, the cell suspension 
was spun for 5 min and the supernatant was collected. Fifty \i\ of the supernatant was added 
in duplicate to 293 cells grown in 96 well plates at 50% confluence. The 293 cells were 

25 incubated for 72 hr at 37°G, then fixed with a mixture of equal parts of methanol and acetone 
for 10 min. and incubated with FITG-labeled anti-adenovirus monoclonal antibodies 
(Ghemicon, Light Diagnostics, Temecuca, GA) for 30 min. Positive nuclear 
immxmofluorescence was interpreted as positive culture. The sensitivity of the assay was 
evaluated by adding dilutions of Ad2/GFTR-1 to 50 ^1 of the lung homogenate from one of 
. 30 the control rats. Viral repliqation was detected when as little as 1 pfti was added. 



RESULTS 

Efficacv of Ad2/CFTR^1 in the lungs of cotton rats . 

To test the ability of Ad2/CFTR-1 to transfer GFTR cDNA to the mtrapuhnonary 
35 airway epithelium, several studies were performed. 4 x 10 pfii - lU of Ad2/GFTR-1 in 100 |il 
was adminstered to seven cotton rats; three control rats received 100 |il of TBS (the vehicle 
for the virus). The rats were sacrificed 4, 1 0 or 14 days later. To detect viral transcripts 
encoding GFTR, reverse transcriptase was used to prepare cDNA from lung homogenates. 
The cDNA was amplified with PGR using primers that span adenovirus and GFTR-encoded 
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sequences. Thus, the procedure did not detect endogenous rat CFTR. Figure 16 shows that 
the lungs of animals which received Ad2/CFTR-1 were positive for virally-encoded CFTR 
mRNA. The lungs of all control rats were negative. 

To detect the protem, lung sections were immunostained with antibodies specific to 
5 CFTR. CFTR was detected at the apical membrane of bronchial epithelium from all rats 
exposed to Ad2/CFTR-1, but not from control rats. The location of recombinant CFTR at the 
apical membrane is consistent with the location of endogenous CFTR in human auway 
epithelium. Recombinant CFTR was detected above background levels because endogenous 
levels of CFTR in airway epithelia are very low and thus, difiScult to detect by 
10 unmunocytochemistry (Trapnell, B. et al. (1991) Proc. Natl Acad Set USA 88:6565-6569; 
Penning, G.M. etal. (1992) Cell Biol 118:551-59). 

These results show that Ad2/CFTR-1 directs the expression of CFTR mRNA in the 
lung of the cotton rat and CFTR protein in the intrapulmonary airways. 

15 Safety of Ad2/CFTR-1 in cotton rats- 

Because the El region of Ad2 is deleted in the Ad2/CFTR-1 virus, the vector was 
expected to be replication-impaired (Berkner, K.L. (1988) BioTechniques 6:616-629) and that 
it would be unable to shut off host cell protein synthesis (Basuss, L.E. et al. (1989) J. Virol 
50:202-212). Previous in vitro studies have suggested that this is the case in a variety of cells 

20 including primary cultures of human airway epithelial cells (Rich, D.P. et al. (1993) Human 
Gene Therapy 4:461-476). However, it is important to confirm this in vivo in the cotton rat, 
which is the most permissive animal model for human adenovirus infection (Ginsberg, H.S. 
et al. (1989) Proc. Natl Acad. Scl USA 86:3823-3827; Prince, G.A. et al. (1993) J, Virol 
67:101-1 1 1). Although dose of virus of 4.1 x lO^^ pfus per kg was used, none of the rats 

25 died. More importantly, extracts &om lung homogenates from each of the cotton rats were 
cultured in the permissive 293 cell line. With this assay 1 pfii of recombinant virus was 
detected in lung homogenate. However, virus was not detected by culture in the lungs of any 
of the treated animals. Thus, the virus did not appear to replicate in vivo. 

It is also possible that administration of Ad2/CFTR-1 could cause an inflammatory 

. 30 response, either due to a direct effect of the vuiis or as a result of administration of viral 
particles. Several studies were performed to test this possibility. None of the rats had a 
change in the total or differential white blood cell count, suggesting that there was no major 
systemic inflammatory response. To assess the pulmonary inflammatory response more 
directly, bronchoalveolar lavage was performed on each of the rats (Figures 17A and 17B). 

35 Figure 1 7A shows that there was no change in the total nimiber of cells recovered from the 
lavage or in the differential cell count. 

Sections of the lung stained by H&E were also prepared. There was no evidence of 
viral inclusions or any other changes characteristic of adenoviral infection (Prince, G. A. et al. 
(1993) y. Virol 67:101-1 1 1). When coded lung sections were evaluated by a skilled reader 
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who was unaware of which sections were treated, she was unable to distinguish between 
sections from the treated and untreated lungs. 

It seemed possible that the recombinant adenovirus could escape from the lung into 
other tissues. To test for this possibility, other organs from the rats were evaluated using 
S nested PGR to detect viral DNA. All organs tested from infected rats were negative, v^dth the 
exception of small bowel which was positive in 3 of 7 rats. Figure 1 8 shows the results of 2 
infected rats and one control rat sacrificed on day 4 after infection. The organ homogenates 
from the infected rats sacrificed were negative for Ad2/CFTR-1 with the exception of the 
small bowel. Organ homogenates from control rats sacrificed on day 4 after infection were 

10 negative for Ad2/CFTR-1 . The presence of viral DNA in the small bowel suggests that the 
rats may have swallowed some of the virus at the time of instillation or, alternatively, the 
normal auivay clearance mechanisms may have resulted in deposition of vural DNA in the 
gastrointestinal tract. Despite the presence of viral DNA in homogenates of small intestine, 
none of the rats developed diarrhea. This result suggests that if the virus expressed CFTR in 

15 the intestinal epithelium, there was no obvious adverse consequence. 

Repeat administration of Ad2/CFTR-1 to cotton rats 

Because adenovirus DNA integration into chromosomal DNA is not necessary for 
gene expression and only occurs at very low frequency, expression following any given 

20 treatment was anticipated to be fmite and that repeated administration of recombinant 

adenovirus would be required for treatment of CF airway disease. Therefore, the effect of 
repeated administration of Ad2/CFTR-1 cotton rats was examined. Twelve cotton rats 
received 50 ^1 of Ad2/CFTR-1. Two weeks later, 9 of the rats received a second dose of 50 |i 
1 of Ad2/CFTR-1 and 3 rats received 50 ^l of TBS. Rats were sacrificed on day 3, 7, or 14 

25 after virus administration. At the time of the second vector administration all cotton rats had 
an increased antibody titer to adenovirus. 

After the second intrapulmonary administration of virus, none of the rats died. 
Moreover, the results of studies assessing safety and efficacy were similar to results obtained 
in animals receiving adenovirus for the first time. Viral cultures of rat lung homogenates on 
» 30 293 cells were negative at all time points, suggesting that there was no virus replication. 
There was no difference between treated and control rats in the total or differential white 
blood coimt at any of the time points. The lungs were evaluated by histologic sections 
stained with H&E; and found no observable differences between the control and treated rats 
when sections were read by us or by a blinded skilled reader. Examples of some sections are 

35 shown in Figure 1 9. When organs were examined for viral DNA using PGR, viral DNA was 
found only in the small intestine of 2 rats. Despite seropositivity of the rats at the time of the 
second administration, expression of CFTR (as assessed by RT-PCR and by 
immimocytochemistry of sections stained with CFTR antibodies) similar to that seen in 
animals that received a single administration was observed. 
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These results suggest that prior admimstration of Ad2/CFTR-1 and the development 
of an antibody response did not cause an inflammatory response in the rats nor did it prevent 
virus-dependent production of CFTR. 

5 Eviidence that AdZ/CFTR-l gxpresscs CFTR iin primate airway gpiithgliiwm 

The cells lining the respkatory tract and the immune system of primates are similar to 
those of humans. To test the ability of Ad2/CFTR-1 to transfer CFTR to the respiratory 
epithelium of primates, Ad2/CFTR was applied on three occasions as described in the 
methods to the nasal epithelium of three Rhesus monkeys. To obtain cells from the 
1 0 respiratory epithelium, the epithelium was brushed using a procedure sunilar to that used to 
sample the airway epithelium of humans during fiberoptic bronchoscopy. 

To assess gene transfer, RT-PCR was used as described above for the cotton rats. RT 
- PGR was positive on cells brushed from the right nostril of all three monkeys, although it 
was only detectable for 1 8 days after virus administration. An example of the results are 
1 S shown in Figure 20A. The presence of a positive reaction in cells from the left nostril most 
likely represents some virus movement to the left side due to drainage, or possibly from the 
monkey moving the virus from one nostril to the other with its fingers after it recovered from 
anesthesia. 

The specificity of the RT-PCR is shown in Figure 20B. A Southern blot with a probe 
20 to CFTR hybridized with the RT-PCR product from the monkey infected with Ad2/CFTR-1, 
As a control, one monkey received a dififerent virus (Ad2/pGal-l) which encodes p- 
galactosidase. When different primers were used to reverse transcribe the p-galactosidase 
mRNA and amplify the cDNA, the appropriate PCR product was detected. However, the 
PCR product did not hybridize to the CFTR probe on Southern blot. This result shows the 
25 specificity of the reaction for amplification of the adenovirus-directed CFTR transcript. 

The failure to detect evidence of adenovirus-encoded CFTR mRNA at 18 days or 
beyond suggests that the sensitivity of the RT-PCR may be low because of limited efficacy of 
the reverse transcriptase or because RNAses may have degraded RNA after cell acquisition. 
Viral DNA, however, was detected by PCR in brushings from the nasal epithelium for 
. 30 seventy days after application of the virus. This result indicates that although mRNA was not 
detected after 2 weeks, viral DNA was present for a prolonged period and may have been 
transcriptionally active. 

To assess the presence of CFTR proteins directly, cells obtained by brushing were 
plated onto slides by cytospin and stained with antibodies to CFTR. Figure 21 shows an 
35 example of the,immunocytochemistry of the brushed cells. A positive reaction is clearly 
evident in cells exposed to Ad2/CFTR-1 . The cells were scored as positive by 
inmivinocytochemistry when evaluated by a reader uninformed to the identity of the samples. 
Immunocytochemistry remained positive for five to six weeks for the three monkeys, even 
after the second administration of Ad2/CFTR-1 . On occasion, a few positive staining cells 
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were observed from the contralateral nostril of the monkeys. However, this was of short 
duration, lasting at most one week. 

Sections of nasal turbinate biopsies obtained within a week after the third infection 
were also examined. In sections from the control monkey, little if any immunofluorescence 
5 from the surface epithelium was observed, but the submucosal glands showed significant 
staining of CFTR (Fig. 22). These observations are consistent with .results of previous 
studies (Engelhardt, J.F. and Wilson, J.M. (1992) Nature Gen. 2:240-248.) In contrast, 
sections from monkeys that received Ad2/CFTR-1 revealed increased immimofluorescence at 
the apical membrane of the surface epithelium. The submucosal glands did not appear to 
10 have greater immunostraining than was observed under control conditions. These results 
indicate that Ad2/CFTR-1 can transfer the CFTR cDNA to the airway epithelium of Rhesus 
monkeys, even in seropositive animals (see below). 

Safety of Ad2/CFTR-1 administered to monkevs 

1 5 Figure 23 shows that all three treated monkeys developed antibodies against 

adenovmis. Antibody titers measured by ELISA rose within two weeks after the first 
infection. With subsequent infections the titer rose within days. The sentinel monkey had 
low antibody titers throughout the experiment. Tests for the presence of neutralizing 
antibodies were also performed. After the first administration, neutralizmg antibodies were 

20 not observed, but they were detected after the second administmtion and during the third viral 
administration (Fig. 23). 

To detect virus, supematants from nasal brushings and swabs were cultured on 293 
cells. All monkeys had positive cultures on day 1 and on day 3 or 4 from the infected nostril. 
Cultures remained positive in one of the monkeys at seven days after administration, but 

25 cultures were never positive beyond 7 days. Live virus was occasionally detected in swabs 
from the contra lateral nostril during the first 4 days after infection. The rapid loss of 
detectable virus suggests that there was not viral replication. Stools were routinely cultured, 
but virus was never detected in stools from any of the monkeys. 

None of the monkeys developed any clinical signs of viral infection or inflammation. 

30 Visual inspection of the nasal epithelium revealed slight erythema in all three monkeys in 
both nostrils on the first day after infection; but similar erythema was observed in the control 
monkey and likely resulted from the instrumentation. There was no visible abnormalities at 
days 3 or 4, or on weekly inspection thereafter. Physical examination revealed no fever, 
lymphadenopathy, conjunctivitis, tachypnea, or tachycardia at any of the time points. No 

35 abnormalities were found in a complete blood count or sedimentation rate, nor were 

abnormalities observed in serum electrolytes, transaminases, or blood urea nitrogen and 
creatinine. 

Examination of Wright-stained cells from the nasal brushings showed that neutrophils 
and lymphocytes accounted for less than 5% of total cells in all three monkeys. 
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Administration of the Ad2/CFTR-1 caused no change in the distribution or number of 
inflammatory cells at any of the time points following virus administration. H&E stains of 
the nasal turbinate biopsies specimens from the control monkey could not be differentiated 
from that of the experimental monkey when the specimens were reviewed by an independent 
5 pathologist. (Fig. 24) 

These results demonstrate the ability of a recombinant adenovirus encodmg CFTR 
(Ad2/CFTR-1) to e>q)ress CFTR cDNA in the airway epithelium of cotton rats and monkeys 
during repeated administration. They also indicate that application of the virus involves little 
if any risk. Thus, they suggest that such a vector may be of value in expressing CFTR in the 

1 0 airway epithelium of humans with cystic fibrosis. 

Two methods were used to show that Ad2/CFTR-1 expresses CFTR in the airway 
epithelium of cotton rats and primates: CFTR mRNA was detected using RT-PCR and 
protein was detected by immimocytochemistry. Duration of expression as assessed 
immunocytochemically was five to six weeks. Because very little protein is required to 

15 generate CI" secretion (Welsh, M.J. (1987) Physiol Rev. 67:1 143-1 184; Trapnell, B,C. et al. 

(1991) Proc. Natl Acad Set USA 88:6565-6569; Denning, GM. et al. (1992) J. Cell Biol 
1 18:551-559), it is likely that functional expression of CFTR persists substantially longer 
than the period of time during which CFTR was detected by immunocytochemistry. Support 
for this evidence comes fix)m two consderations: first, it is very difficult to detect CFTR 

20 immimcytochemically in the airway epithelixmi, yet the expression of an apical membrane 
CI" permeability due to the presence of CFTR CI" channels is readily detected. The ability 
of a minimal amount of CFTR to have important functional effects is likely a result of the 
fact that a single ion channel conducts a very large number of ions (10^ - 10^ ions/sec). 
Thus, ion channels are not usually abundant proteins in epithelia. Second, previous work 

25 suggests that the defective electrolyte transport of CF epithelia can be corrected when only 6- 
10% of cells in a CF airway epithelium overexpress wild-type CFTR (Johnson, L.G. et al. 

(1992) Nature Gen. 2:21-25). Thus, correction of the biologic defect in CF patients may be 
possible when only a small percent of the cells express CFTR. This is also consistent with 
our previous studies in vitro showing that Ad2/CFTR-1 at relatively low multiplicities of 

30 infection generated a cAMP-stimulated CI*" secretory response in CF epithelia (Rich, D.P. et 
al. (1993) Human Gene Therapy 4:461-476). 

This study also provides the first comprehensive data on the safety of adenovirus 
vectors for gene transfer to auway epithelium. Several aspects of the studies are 
encouraging. There was no evidence of viral replication, rather infectious viral particles were 

35 rapidly cleared fi*om both cotton rats and primates. These data, together with our previous in 
vitro studies, suggest that replication of recombinant virus in humans will likely not be a 
problem. The other major consideration for safety of an adenovirus vector in the treatment of 
CF is the possibility of an inflammatory response. The data indicate that the virus gerierated 
an antibody response in both cotton rats and monkeys. Despite this, no evidence of a 
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systemic or local inflammatory response was observed. The cells obtained by 
bronchoalveolar lavage and by brushing and swabs were not altered by virus application. 
Moreover, the histology of epithelia treated with adenovirus was indistinguishable from that 
of control epithelia. These data suggest that at least three sequential e?q)osures of airway 
5 epithelium to adenoviros does not cause a detrimental inflammatory response. 

These data suggest that Ad2/CFTR-1 can effectively transfer CFTR cDNA to airway 
epithelium and direct the expression of CFTR. They also suggest that transfer is relatively 
safe in animals. Thus, they suggest that Ad2/CFTR-1 may be a good vector for treating 
patients with CF. This was confirmed in the foUowmg example. 

10 

Example 1 0 - CFTR Gene Therapv in Nasal Epithelia from Human CF Subjects 
EXPERIMENTAL PROCEDURES 

15 A<ienovirtfs vectpr 

The recombinant adenovirus Ad2/CFTR-1 was used to deliver CFTR cDNA. The 
construction and preparation of Ad2/CFTR-1, and its use in vitro and in vivo in animals, has 
been previously described (Rich, D.P. et al. (1993) Human Gene Therapy 4:461-476; Zabner, 
J. et al. (1993) Nature Gen. (in press)). The DNA construct comprises a full length copy of 

20 the Ad2 genome from which the early region 1 genes (nucleotides 546 to 3497) have been 
replaced by cDNA for CFTR. The viral Ela promoter was used for CFTR cDNA; this is a 
low to moderate strength promoter. Termination/polyadenylation occurs at the site normally 
used by Elb and protein IX transcripts. The E3 region of the virus was conser\'ed. 

25 £ati£Qts 

Three patients with CF were studied. Genotype was determined by IG Labs 
(Framingham, MA). All three patients had mild CF as defined by an NIH score > 70 
(Taussig, L.M, et al. (1973) J. Pediatr. 82:380-390), a normal weight for height ratio, a 
forced expiratory volume in one second (FEVl) greater than 50% of predicted and an arterial 
. 30 PO2 greater than 72. All patients were seropositive for type 2 adenovirus, and had no recent 
viral illnesses. Pretreatment cultures of nasal swabs, pharyngeal swabs, sputum, urine, stool, 
and blood leukocytes were negative for adenovirus. PCR of pretreatment nasal brushings 
using primers for the adenovirus El region were negative. Patients were evaluated at least 
twice by FEVl, cytology of nasal mucosa, visual inspection, and measurement of before 
35 treatment. Prior to treatment, a coronal computed tomographic scan of the paranasal sinuses 
and a chest X-ray were obtained. 

The first patient was a 21 year old woman who was diagnosed at 3 months after birth. 
She had pancreatic insufficiency, a positive sweat chloride test (101 mEq/l), and is 
homozygous for the AF508 mutation. Her NIH score was 90 and her FEVl was 83% 
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predicted. The second patient was a 36 year old man who was diagnosed at the age of 1 3 
when he presented with symptoms of pancreatic insufficiency. A sweat chloride test revealed 
a chloride concentration of 70 mEq/1. He is a heterozygote with the AFS08 and GSSID 
mutations. His NIH score was 88 and his FEVI was 66% predicted. The third patient was a 
S SO year old woman, diagnosed at the age of 9 with a positive sweat chloride test (1 04 mEq/1 ). 
She has pancreatic insufficiency and insulin dependent diabetes mellitus. She is homozygous 
for the AF508 mutation. Her NIH score was 73 and her FEVI was 65% predicted. 

Transepithelial voltage 

10 The transepithelial electric potential difference across the nasal epithelium was 

measured using techniques similar to those previously described (Alton, E. W.F. W. et al 
(1987) Thorax 42:815-817; Knowles, M. et al. (1981) K Eng. 1 Med. 305:1489-1495). A 23 
gauge subcutaneous needle connected with sterile normal saline solution to a silver/silver 
chloride pellet (E. W, Wright, Guilford, CT) was used as a reference electrode. The exploring 

1 5 electrode was a size 8 rubber catheter (modified Argyle^ Foley catheter, St Louis, MO) with 
one side hole at the tip. The catheter was filled with Ringer's solution containing (in mM), 
135 NaCl, 2.4 KH2PO2, K2HPO4, L2CaCL2, 1.2 MgCl2 and 10 Hepes (titrated to pH 7.4 
with NaOH) and was connected to a silver/silver chloride pellet. Voltage was measured with 
a voltmeter (Keithley Instruments Inc., Cleveland, OH) connected to a strip chart recorder 

20 (Servocorder, Watanabe Instruments, Japan). Prior to the measurements, the silver/silver 
chloride pellets were connected in series with the Ringer's solution; the pellets were changed 
if the recorded was greater than ±4 mV. The mbber catheter was mtroduced into the 
nostril under telescopic guidance (Hopkins Telescope, Karl Storz, Tuttlingen West Germany) 
and the side hole of the catheter was placed next to the study area in the medical aspect of the 

25 inferior nasal turbinate. The distance fix>m the anterior tip of the inferior turbinate and the 
spatial relationship with the medial turbinate, the maxillary sinus ostium, and in one patient a 
small polyp, were used to locate the area of Ad2/CFTR-1 administration for measurements. 
Photographs and video recorder images were also used. Basal was recorded until no 
changes in were observed after slow intermittent 100 ^l/nun infusion of tiie Ringer's 
. 30 solution. Once a stable baseline was achieved, 200 |il of a Ringer's solution containing 100 
M amiloride (Merck and Co. Inc., West Point, PA) was instilled through the catheter and 
changes in were recorded until no further change were observed after intermittent 
instillations. Finally, 200 \i\ Ringer's solution containing 100 |aM amiloride plus 10 fiM 
terbutaline (Geigy Pharmaceuticals, Ardsley, NY) was instilled and the changes in were 

35 recorded. 

Measurements of basal were reproducible over time: in the three treated patients, 
the coefficients of variation before administration of Ad2/CFTR-1 were 3.6%, 12%, and 
12%. The changes induced by terbutaline were also reproducible. In 30 measurements in 9 
CF patients, the terbutaline-induced changes in Vj (AV^) ranged from 0 mV to +4 mV; 
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hyperpolarization of was never observed. In contrast, in 7 normal subjects AVt ranged 
from -1 mV to -5 mV; hypeipolarization was always observed. 

Ad2/CFTR-1 application and cell acquisition 
5 The patients were taken to the operating room and monitoring was conmienced using 

continuous EKG and pulse oximetry recording as well as automatic intemiittent blood 
pressure measurement. After mild sedation, the nasal mucosa was anesthetized by atomizing 
0.5 ml of 5% cocaine. The mucosa in the area of the inferior turbinate was then packed with 
cotton pledgets previously soaked in a mixture of 2 ml of 0.1% adrenaline and 8 ml of 1% 

10 tetracaine. The pledgets remained in place for 10-40 min. Using endoscopic visualization 
with a television monitoring system, the ^plicator was introduced through the nostril and 
positioned on the medial aspect of the inferior turbinate, at least three centimeters from its 
anterior tip (Figures 25A-25I). The viral suspension was inftised into the applicator through 
connecting catheters. Hie position of the applicator was monitored endoscopically to ensure 

1 S that it did not move and that enough pressure was applied to prevent leakage. After the virus 
was in contact with the nasal epithelium for thirty minutes, the viral suspension was removed, 
and the applicator was withdrawn. In the third patient's right nasal cavity, the virus was 
applied using the modified Foley catheter used for Vt measurements. The catheter was 
introduced without anesthetic under endoscopic guidance until the side hole of the catheter 

20 was in contact with the area of interest in the inferior turbinate. The viral solution was 

inftised slowly until a drop of solution was seen with the telescope. The catheter was left in 
place for thirty minutes and then removed. 

Cells were obtained from the area of virus administration approximately 2 weeks 
before treatment and then at weekly intervals after treatment. The inferior turbinate was 

25 packed for 10 minutes with cotton pledgets previously soaked in 1 ml of 5% cocaine. Under 
endoscopic control, the area of administration was gently brushed for 5 seconds. The brushed 
cells were dislodged in PBS. Swabs of the nasal epithelia were collected using cotton tipped 
applicators without anesthesia. Cytospin slides were prepared and stained with Wrighfs 
stain. Light microscopy was used to assess the respiratory epithelial cells and inflammatory 

30 cells. For biopsies, sedatives/anesthesia was administered as described for the application 
procedure. After endoscopic inspection, and identification of the site to be biopsied, the 
submucosa was injected with 1% xylocaine, with 1/100,000 epinephrine. The area of virus 
application on the inferior turbinate was removed. The specimen was fixed in 4% 
formaldehyde and stained. 

35 

RESULTS 

On day one after Ad2/CFTR-1 administration and at all subsequent time points, 
Ad2/CFTR-1 from the nasal epithelium, pharynx, blood, urine, or stool could not be cultured. 
As a control for the sensitivity of the culture assay, samples were routinely spiked with 10 
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and 100 lU Ad2/CFTR-1 . In every case, the spiked samples were positive, indicating that, at 
a minimum, 10 lU of Ad2/CFTR should have been detected. No evidence of a systemic 
response as assessed by history, physical examination, serum chemistries or cell counts, chest 
and sinus X-rays, pulmonary function tests, or arterial blood gases performed before and after 
5 Ad2/CFTR-1 administration. An increase in antibodies to adenovirus was not detectable by 
ELISA or by neutralization for 35 days after treatment. 

Three to four hours after Ad2/CFTR-1 administration, at the time that local anesthesia 
and localized vasoconstriction abated, all patients began to complain of nasal congestion and 
in one case, mild rhinorrhea. These were isolated symptoms that duninished by 1 8 hours and 

1 0 resolved by 28 to 42 hours. Inspection of the nasal mucosa showed mild to moderate 

erythema, edema, and exudate (Figures 25A-25C). These physical findmgs followed a time 
course similar to the symptoms. The physical findings were not limited to the site of virus 
application, even though preliminary studies using the applicator showed that marker 
methylene blue was lunited to the area of application. In two additional patients with CF, the 

1 5 identical anesthesia and application procedure were used, but saline was applied instead of 
vhrus, yet the same symptoms and physical findings were observed in these patients (Figures 
2SG-25I). Moreover, the local anesthesia and vasoconstriction generated similar changes 
even when the applicator was not used, suggesting that the anesthesia/vasoconstriction caused 
some, if not all the mjury. Twenty-four hours after the application procedure, analysis of 

20 ceils removed from nasal swabs revealed an equivalent increase in the percent neutrophils in 
patients treated with Ad2/CFTR-1 or with salme. One week after application, the 
neutrophilia had resolved in both groups. Respiratory epithelial cells obtained by nasal 
brushing appeared normal at one week and at subsequent time points, and showed no 
evidence of inclusion bodies. To further evaluate the mucosa, the epithelium was biopsied on 

25 day three in the &st patient and day one in the second patient. Independent evaluation by 
two pathologists not otherwise associated with the study suggested changes consistent with 
mild trauma and possible ischemia (probably secondary to the anesthetic/vasoconstrictors 
used before virus administmtion), but there were no abnormalities suggestive of virus- 
mediated damage. 

30 Because the application procedure produced some mild mjury in the first two patients, 

the method of administration was altered in the third patient. The method used did not 
require the use of local anesthesia or vasoconstriction and which was thus less likely to cause 
injury, but which was also less certain in its ability to constrain Ad2/CFTR-1 in a precisely 
defined area. On the right side, Ad2/CFTR-1 was administered as in the first two patients, 

35 and on the left side, the virus was administered without anesthesia or the applicator, instead 
using a small Foley catheter to apply and maintain Ad2/CFTR-1 in a relatively defined area 
by surface tension (Figure 25E). On the right side, the symptoms and physical findings were 
the same as those observed in the first two patients. By contrast, on the left side there were 
no symptoms and on inspection the nasal mucosa appeared normal (Figures 25D-25F). Nasal 
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swabs obtained from the right side showed neutrophilia similar to that observed in the first 
two patients. In contrast, the left side which had no anesthesia and minimal manipulation, 
did not develop neutrophilia. Biopsy of the left side on day 3 after administration (Figure 
26), showed morphology consistent with CF- a thickened basement membrane and 
5 occasional polymorphonuclear cells in the submucosa- but no abnormalities that could be 
attributed to the adenovirus vector. 

The first patient developed symptoms of a sore throat and increased cough that began 
three weeks after treatment and persisted for two days. Six weeks after treatment she 
developed an exacerbation of her bronchitis/bronchiectasis and hemoptysis that required 

1 0 hospitalization. The second patient had a transient episode of minimal hemoptysis three 
weeks after treatment; it was not accompanied by any other symptoms before or after the 
episode. The third patient has an exacerbation of bronchitis three weeks after treatment for 
which she was given oral antibiotics. Based on each patient's pretreatment clinical history, 
evaluation of the episodes, and viral cultures, no evidence could be discerned that linked 

1 5 these episodes to administration of Ad2/CFTR- 1 . Rather the episodes appeared consistent 
with the norma] course of disease in each individual. 

The loss of CFTR CI" channel fimction causes abnormal ion transport across affected 
epithelia, which in tum contributes to the pathogenesis of CF-associated airway disease 
(Boat, T.F. et al. in The Metabolic Basis of Inherited Diseases (Scriver, C.R. et al. eds., 

20 McGraw-Hill, New York (1989); Quinton, P.M. (1990) FASEB J. 4:2709-2717). In airway 
epithelia, ion transport is dominated by two electrically conductive processes: amiloride- 
sensitive absorption of Na"^ from the mucosal to the submucosal surface and cAMP- 
stimulated CI" secretion in the opposite direction. (Quinton, P.M. (1990) FASEB J. 4:2709- 
2717; Welsh, M.J. (1987) Physiol. Rev. 67:1 143-1 184). These two transport processes can be 

25 assessed noninvasively by measuring the voltage across the nasal epithelium (Vt) in vivo 
(Knowles, M. et al (1981) K Eng. J. Med 305:1489-1495; Alton, E.W.F.W. et al.(1987) 
Thorax 42:815-817). Figure 27 shows an example from a normal subject. Under basal 
conditions, was electrically negative (lumen referenced to the submucosal surface). 
Perfiision of amiloride (100 |iM) onto the mucosal surface inhibited by blocking apical 

30 Na+ channels (Knowles, M. et al (1981) N. Eng. J. Med. 305:1489-1495; Qumton, P.M. 
(1990) FASEB J. 4:2709-2717; Welsh, M.J. (1992) Neuron 8:821-829). Subsequent 
perfiision of terbutaline (10 |iM) a p-adrenergic agonist, hyperpolarized by increasing 
cellular levels of cAMP, opening CFTR CI" channels, and stimulating chloride secretion 
(Quinton, P.M. (1990) FASEB J. 4:2709-2717; Welsh, M.J. et al. (1992) Neuron 8:821-829). 

35 Figure 28A shows results from seven normal subjects: basal was -10.5 ± l.OmV, and in 
the presence of amiloride, terbutaline hyperpolarized Vt by -2.3 ± 0.5mV. 

In patients wdth CF, was more electrically negative than in normal subjects (Figure 
28B), as has been previously reported (Knowles, M. et al. (1981) N. Eng. J. Med 305:1489- 
1495). Basal Vt was -37.0 ± 2.4 mV, much more negative than values in normal subjects (P< 
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0.001). (Note the difference in scale in Figure 28A and Figure 28B). Amiloride inhibited V^, 
as it did in normal subjects. However, Vt failed to hyperpolarize when terbutaline was 
perfused onto the epithelium in the presence of amiloride. Instead, Vt either did not change 
or became less negative: on average depolarized by +1 .8 ± 0.6 mV, a result very different 
5 from that observed in normal subjects. (P<0.001). 

After Ad2/CFTR-1 was applied, basal Vt became less negative in all three CF 
patients: Figure 29A shows an example &om the third patient before (Figure 29A) and after 
(Figure 29B) treatment and Figures 30A, 30C, and 30E show the time course of changes m 
basal Vt for all three patients. The decrease in basal Vt suggests that application of 

10 Ad2/CFTR-1 corrected the CF electolyte transport defect in nasal epithelium of all three 
patients. Additional evidence came from an examination of the response to terbutaline. 
Figure BOB shows that in contrast to the response before Ad2/CFTR-1 was applied, after 
virus replication, in the presence of amiloride, terbutaline stimulated Vt- Figures SOB, SOD, 
and SOF show the time course of the response. These data indicate that Ad2/CFTR-1 

1 5 corrected the CF defect in CI" transport. Correction of the CI" transport defect cannot be 
attributed to the anesthesia/application procedure because it did not occur in patients treated 
with saline instead of Ad2/CFTR-1 (Figure 31). Moreover, the effects of the anesthesia were 
generalized on the nasal mucosa, but basal Vt decreased only in the area of virus 
administration. Finally, similar changes were observed in the left nasal mucosa of the third 

20 patient (Figures SOE and SOF), which had no symptomatic or physical response after the 
modified application procedure. 

Unsuccessful attempts were made to detect CFTR transcripts by reverse transciptase- 
PCR and by immunocytochemistry in cells from nasal brushings and biopsies. Although 
similar studies in animals have been successfiil (Zabner, J. et al. (1993) Nature Gen, (in 

25 press)), those studies used much higher doses of Ad2/CFTR-L The lack of success in the 
present case likely reflects the small amount of available tissue, the low MOI, the fact that 
only a fraction of cells may have been corrected, and the fact that Ad2/CFTR-1 contains a 
low to moderate strength promoter (Ela) which produces much less mRNA and protein than 
comparable constructs using a much stronger CMV promoter (unpublished observation). The 
'30 Ela promoter was chosen because CFTR normally expressed at very low levels in airway 
epithelial cells (Trapnell, B.C. et al. (1991) Proc. Natl Acad Set USA 88:6565-6569). It is 
also difficult to detect CFTR protein and mRNA in normal human airway epithelia, although 
function is readily detected because a single ion channel can conduct a very large number of 
ions per second and thus efficiently support C 1 " transport. 

35 With time, the electrical changes that indicate correction of the CF defect reverted 

toward pretreatment values. However, the basal Vt appeared to revert more slowly than did 
the change in Vt produced by terbutaline. The significance of this difference is \mknown, but 
it may reflect the relative sensitivity of the two measurements to expression of normal CFTR. 
In any case, this study was not designed to test the duration of correction because the treated 



-51 - 

area was removed by biopsy on one side and the nasal, mucosa on the other side was brushed 
to obtain cells for analysis at 7 to 10 days after virus administration, and then at 
approximately weekly intervals. Brushmg the mucosa removes cells, disrupts the epithelium, 
and reduces basal to zero for at least two days afterwards, thus preventing an accurate 
5 assessment of duration of the effect of Ad2/CFTR-1 . 

EflTiPacy of adenpvin^-mcdiifitgd gmg trmsfert 

The major conclusion of this study is that in vivo application of a recombinant 
adenovirus encoding CFTR can correct the defect in auway epithelial CI" transport that is 

1 0 characteristic of CF epithelia. 

Complementation of the CI' channel defect in human nasal epithelium could be 
measured as a change in basal yoltage and as a change in the response to cAMP agonists. 
Although the protocol was not designed to establish duration, changes in these parameters 
were detected for at least three weeks. These results rq)resent the first report that 

1 5 administration of a recombinant adenovirus to humans can correct a genetic lesion as 
measured by a functional assay. This study contrasts with most earlier attempts at gene 
transfer to humans, in that a recombinant viral vector was administered directly to humans, 
' rather than using a in vitro protocol involving removal of cells fi'om the patient, transduction 
of the cells in culture, followed by remtroduction of the cells into the patient. 

20 Evidence that the CF C 1 " transport defect was corrected at all three doses of virus, 

corresponding to 1, 3, and 25 MOI, was obtained. This result is consistent with earlier 
studies showing that similar MOIs reversed the CF fluid and electrolyte transport defects in 
primary cultures of CF airway cells grown as epithelia on permeable filter supports (Rich, 
D.P. et al. (1993) Human Gene Therapy 4:461-476 and Zabner et al. submitted for 

25 publication): at an MOI of less than 1, cAMP-stimulated CI" secretion was partially restored, 
and after treatment with 1 MOI Ad2/CFTR-1 cAMP agonists stimulated fluid secretion that 
was within the range observed in epithelia firom normal subjects. At an MOI of 1, a related 
adenovirus vector produced p-galactosidase activity in 20% of infected epithelial cells as 
assessed by fluorescence-activated cell analysis (Zabner et al. submitted for publication). 

30 Such data would imply that pharmacologic dose of adenovirus in CF airways might 

correspond to an MOI of one. If it is estimated that there are 2x1 0^ cells/cm^ in the airway 
(Mariassy, A.T. in Comparative Biology of the Normal Lung (CRC Press, Boca Raton 1992), 
and that the airways from the trachea to the respiratory bronchioles have a surface area of 
1400 cm2 (Weibel, E.R. Morphometry of the Human Lung (Springer Verlag, Heidelberg, 

35 1963) then there would be approximately 3x1 0^ potential target cells. Assuming a particle to 
lU ratio of 100, this would correspond to approximately 3x1 0^ 1 particles of adenovirus with 
a mass of approximately 75 |ig. While obviously only a crude estimate, such information is 
usefiil in designing animal experiments to establish the likely safety profile of a human dose. 
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It is possible that an efficacious MOI of recombinant adenovirus could be less than 
the lowest MOI tested here. Some evidence suggests that not all cells in an epithelial 
monolayer need to express CFTR to correct the CF electrolyte transport defects. Mixing 
experiments showed that when perhaps 5-10% of cells overexpress CFTR, the monolayer 
5 exhibits v^ld-type electrical properties (Johnson, L.G. et al (1992) Nature Gen, 2:21-25). 
Studies usmg liposomes to express CFTR in mice bearing a disrupted CFTR gene also 
suggest that only a small proportion of cells need to be corrected (Hyde, S.C. et al. (1993) 
Nature 362:250-255). The results referred to above using airway epithelial monolayers and 
multiplicities of Ad2/CFTR-1 as low as 0.1 showed measurable changes in CI' secretion 
10 (Rich, D.P. et al. (1993) Human Gene Therapy 4:461-476 and Zabner et al. submitted for 
publication). 

Given the very high sensitivity of electrolyte transport assays (which result because a 
single CI" channel is capable of transporting large numbers of ions/sec) and the low activity 
of the Ela promoter used to transcribe CFTR, the inability to detect CFTR protein and CFTR 

1 5 mRNA are perhaps not surprising. Although CFTR mRNA could not be detected by reverse 
transcriptase-PCR, Ad2/CFTR-1 DNA could be detected in the samples by standard PCR , 
demonstrating the presence of input DNA and suggesting that the reverse transcriptase 
reaction may have been suboptimal. This could have occurred because of factors in the tissue 
that inhibit the reverse transcriptase. Although there is little doubt that the changes in 

20 electrolyte transport measured here result from expression of CFTR, it remains to be seen 
whether this will lead to measurable clinical changes in lung function. 

Safetv considerations. 

Application of the adenovirus vector to the nasal epithelium in these three patients 

25 was well-tolerated. Although mild inflammation was observed in the nasal epithelium of all 
three patients following administration of Ad2/CFTR-1, similar changes were observed in 
two volunteers who underwent a sham procedure using saline rather than the viral vector. 
Clearly a combination of anesthetic- and procedure-related trauma resulted in the changes in 
the nasal mucosa. There is insufficient evidence to conclude that no inflammation results 

30 from virus administration. However, using a modified administration of the highest MOI of 
virus tested (25 MOI) in one patient, no inflammation was observed under conditions that 
resulted in evidence of biophysical efficacy that lasted until the area was removed by biopsy 
at three days. 

There was no evidence of replication of Ad2/CFTR-1 . Earlier studies had established 
35 that replication of Ad2/CFTR-1 in tissue culture and experimental animals is severely 

impaired (Rich, D.P. et al. (1993) Human Gene Therapy 4:46U47 6; Zabner, J. et al. (1993) 
Nature Gen, (in press)). Replication only occurs in cells that supply the missing early 
proteins of the El region of adenovirus, such as 293 cells, or under conditions where the El 
region is provided by coinfcction with or recombination with an El-containing adenovirus 



-53- 

(Graham, FX. and Prevec, L. Vaccines: New Approaches to Immunological Problems (R.W. 
Ellis, ed., Boston, Butterworth-Heineraiann, 1992); Berkner, K.L. (1988) Biotechniques 
6:61 6-629). The patients studied here were seropositive for adenovirus types 2 and 5 prior to 
the study were negative for adenovirus upon culture of nasal swabs prior to administration of 
5 Ad2/CFTR-1, and were shown by PGR methods to lack endogenous El DNA sequences such 
as have been reported in some human subjects (Matsuse T. et al. (1992) Am, Rev. Respir. Dis. 
146:177-184). 

Exampk 1 \ ■ Cpffsfrwctipn and Pacl^aging of Psgwio Adenoviral Vector (PAY) 

10 With reference to Figure 32, the PAV construct was made by inserting the Ad2 

packaging signal and El enhancer region (0-358 nt) in Bluescript II SK- (Stratagene, LaJoUa, 
CA). A variation of this vector, known as PAV 11 was constructed similarly, except the Ad2 
packaging signal and El enhancer region contained 0-380 nt. The addition of nucleotides at 
the 5' end results in larger PAVs, which may be more efficiently packaged, yet would include 

1 5 more adenoviral sequences and therefore could potentially be more immunogenic or more 
capable of replicating. 

To allow ease of manipulation for either the insertion of gene coding regions or 
complete excision and use in transfections for the purpose of generating infectious particles, a 
complementary plasmid was also built in pBluescript SKII-. This complementary plasmid 

20 contains the Ad2 major late promoter (MLP) and tripartite leader (TPL) DNA and an SV40 
T-antigen nuclear localization signal (NLS) and polyadenylation signal (SVpA), As can be 
seen in Figure 32, this plasmid contains a convenient restriction site for the insertion of genes 
of interest between the MLP/TPL and SV40 poly A. This construct is engineered such that 
the entire cassette may be excised and inserted into the former PAV I or PAV II construct. 

25 Generation of PAV infectious particles was performed by excision of PAV from the 

plasmid with the Apa I and Sas II restriction endonucleases and co-transfection into 293 cells 
(an Ela/Elb expressing cell line) (Graham, F.L. et al, (1977) J. Gen Virol 36:59-74) with 
either wild-type Ad2, or packaging/replication deficient helper virus. Purification of PAV 
from helper can be accompanied by CsCl gradient isolation as PAV viral particles will be of a 
» 30 lower density and will band at a higher position in the gradient. 

For gene therapy, it is desirable to generate significant quantities of PAV virion free 
. from contaminating helper virus. The primary advantage of PAV over standard adenoviral 
vectors is the ability to package large DNA inserts into virion (up to about 36 kb). However, 
PAV requires a helper virus for replication and packaging and this helper virus v^U be the 

35 predominant species in any PAV preparation. To increase the proportion of PAV in viral 
preparation several approaches can be employed. For example, one can use a helper virus 
which is partially defective for packaging into virions (either by virtue of mutations in the 
packaging sequences (Grable, M. and Hearing P. (1992) J. Virol 66: 723-731)) or by virtue 
of its size -viruses with genome sizes greater than approximately 37.5 kb package 
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inefficiently. In mixed infections with packaging defective virus, PAV would be expected to 
be represented at higher levels in the virus mixture than would occur with non-packaging 
defective helper viruses. 

Another approach is to make the helper virus dependent upon PAV for its own 
S replication. This may most easily be accomplished by deleting an essential gene from the 
helper virus (e.g. IX or a terminal protein) and placing that gene in the PAV vector. In this 
way neither PAV nor the helper virus is capable of independent rq)lication - PAV and the 
helper virus are therefore co-dependent. This should result in higher PAV representation in 
the resulting virus preparation. 

10 A third approach is to develop a novel packaging cell line, which is capable of 

generating significant quantities of PAV virion free from contaminating helper virus. A 
novel protein DC, (pIX) packaging system has been developed. This system exploits several 
documented features of adenovirus molecular biology. The first is that adenoviral defective 
particles are known to comprise up to 30% or more of standard wild-type adenoviral 

1 5 preparations. These defective or incomplete particles are stable and contain 15-95% of the 
adenoviral genome, typically 15-30%. Packagmg of a PAV genome (15-30% of wild-type 
genome) should package comparably. Secondly, stable packaging of full-length Ad genome 
but not genomes <95% requked the presence of the adenoviral gene designated pDC. 
The novel packaging system is based on the generation of an Ad protein pIX 

20 expressing 293 cell line. In addition, an adenoviral helper virus engineered such that the El 
region is deleted but enough exogenous material is inserted to equal or slightly exceed the 
full length 36 kb size. Both of these two constructs would be introduced into the 293/pIX 
cell line as purified DNA. In the presence of pIX, yields of both predicted progeny viruses as 
seen in current PAV/Ad2 production experiments can be obtained. Virus containing lysates 

25 from these cells can then be titered independently (for the marker gene activity specific to 
either vector) and used to infect standard 293 (lacking pIX) at a multiplicity of infection of 1 
relative to PAV. Since research with this line as well as from incomplete or defective particle 
research indicates that full length genomes have a competitive packaging advantage, it is 
expected that infection with an MOI of 1 relative to PAV will necessarily equate to an 

30 effective MOI for helper of greater than 1 . All cells will presumably contain both PAV (at 
least 1) and helper (greater than 1). Replication and viral capsid production in this cell 
should occur normally but only PAV genomes should be packaged. Harvesting these 
293/pIX cultures is expected to yield essentially helper-free PAV. 

35 Example 12 - C onstruction of Ad2-E4/ORF 6 

Ad2-E4/ORF6 (Figure 33 shows the plasmid construction of Ad2-E4/ORF6) which is 
an adenovirus 2 based vector deleted for all Ad2 sequences between nucleotides 32815 and 
35577. This deletion removes all open reading frames of E4 but leaves the E4 promoter and 
first 32-37 nucleotides of the E4 mRNA intact. In place of the deleted sequences, a DNA 
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fragment encoding ORF6 (Ad2 nucleotides 34082-33178) which was derived by polymerase 
chain reaction of Ad2 DN A with 0RF6 specific DNA primers 

(Genzyme oligo. # 2371 - CGGATCCTTTATTATAGGGGAAGTCCACGCCTAC (SEQ. 
ID N0:8) and oligo. #2372 - CGGGATCCATCGATGAAATATGACTACGTCCG (SEQ. 
5 ID N0:9) were inserted). Additional sequences supplied by the oligonucleotides included a 
cloning site at the 5' and 3' ends of the PGR fragment (Clal and EamHl respectively) and a 
polyadenylation sequence at the 3* end to ensure correct polyadenylation of the 0RF6 
mRNA. As illustrated in Figure 33, the PGR fragment was first ligated to a DNA fragment 
including the inverted temunal repeat (ITR) and E4 promoter region of Ad2 (Ad2 nucleotides 

10 35937-35577) and cloned in the bacterial plasmid pBluescript (Stratagene) to create plasmid 
0RF6. After sequencing to verify the integrity of the 0RF6 reading frame, the fragment 
encompassing the ITR and 0RF6 was subcloned into a second plasmid, pAd A E4, which 
contains the 3' end of Ad2 from a Sac I site to the 3* ITR (Ad2 nucleotides 28562-35937) and 
is deleted for all E4 sequences (promoter to poly A site Ad2 positions 3281 5-35641) using 

15 flanking restriction sites. In this second plasmid, virus expressing only E4 0RF6, pAdORF6 
was cut with restriction enzyme Pad and ligated to Ad2 DNA digested with Pad . This Pad 
site corresponds to Ad2 nucleotide 28612. 293 cells were transfected with the ligation and 
the resulting virus was subjected to restriction analysis to verify that the Ad2 E4 region had 
been substituted with the conresponding region of pAdORF6 and that the only remaining E4 

20 open reading frame was 0RF6. 

A cell line could in theory be established that would fully complement E4 functions 
deleted from a recombinant virus. The .problem with this approach is that E4 functions in the 
regulation of host cell protein synthesis and is therefore toxic to cells. The present 
recombinant adenoviruses are deleted for the El region and must be grown in 293 cells which 

25 complement El functions. The E4 promoter is activated by the Ela gene product, and 

therefore to prevent inadvertent toxic expression of E4 transcription of E4 must be tightly 
regulated. The requirements of such a promoter or transactivating system is that in the 
uninduced state expression must be low enough to avoid toxicity to the host cell, but in the 
induced state must be sufficiently activated to make enough E4 gene product to complement 

30 the E4 deleted virus during virus production. 

p?i;9mple 13 

An adenoviral vector is prepared as described in Example 7 while substituting the 
phosphoglycerate kinase (PGK) promoter for the Ela promoter. 

35 

Example 14 

An adenoviral vector is prepared as described in Example 1 1 while substituting the 
PGK promoter for the Ad2 major late promoter (MLP). 
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Example 15: Generation of AdS-ORFfi/PGK-CFTR 

This protocol uses a second generation adenovirus vector named Ad2-ORF6/PGK- 
CFTR. This virus lacks El and in its place contains a modified transcription unit with the 
PGK promoter and a poly A addition site flanking the CFTR cDNA. The PGK promoter is 
5 of only moderate strength but is long lasting and not subject to shut off. The E4 region of the 
vector has also been modified in that the whole coding sequence has been removed and 
replaced by 0RF6, the only E4 gene essential for growth of Ad in tissue culture. This has the 
effect of generating a genome of 101% the size of wild type Ad2. 

The DNA construct comprises a full length copy of the Ad2 genome from which the 
1 0 early region 1 (El) genes (present at the S' end of the viral genome) have been deleted and 
replaced by an expression cassette encodmg CFTR. The expression cassette includes the 
promoter for phosphoglycerate kinase (PGK) and a polyadenylation (poly A) addition signal 
from the bovine growth hormone gene (BGH). In addition, the E4 region of Ad2 has been 
deleted and replaced with only open reading frame 6 (0RF6) of the Ad2 E4 region. The 
1 5 adenovirus vector is referred to as AD2-ORF6/PGK-CFTR and is illustrated schematically in 
Figure 34. The entire wild-type Ad2 genome has been previously sequenced (Roberts, R.J., 
(1986) In Adenovirus DNA, W. Oberfler, editor, Matinus Nihoff Publishing, Boston) and the 
existing numbering system has been adopted here when referring to the wild type genome. 
Ad2 genomic regions flanking El and E4 deletions, and insertions into the genome are being 
20 completely sequenced. 

The Ad2-ORF6/PGK-CFTR construct differs from the one used in our earlier 
protocol (Ad2/CFTR-1) in that the latter utilized the endogenous Ela promoter, had no poly 
A addition signal directly downstream of CFTR and retained an intact E4 region. The 
properties of Ad2/CFTR-1 in tissue culture and in animal studies have been reported (Rich et 
25 al,, (1993) Human Gene Therapy 4:461-467; and Zabner et al. (1993) Nature Genetics (m 
Press), 

At the S' end of the genome, nucleotides 357 to 3328 of Ad2 have been deleted and 
replaced with (in order 5' to 3') 22 nucleotides of linker, 534 nucleotides of the PGK 
promoter, 86 nucleotides of linker, nucleotides 123-4622 of the published CFTR sequence 

30 (Riordan et al. (1989) Science 245:1066-1073), 21 nucleotides of linker, and a 32 nucleotide 
synthetic BGH poly A addition signal followed by a fmal 1 1 nucleotides of linker. The 
topology of the 5' end of the recombinant molecule is illustrated in Figure 34. 

At the 3* end of the genome of Ad2-ORF6/PGK-CFTR, Ad2 sequences between 
nucleotides 32815 and 35577 have been deleted to remove all open reading frames of E4 but 

35 retain the E4 promoter, the E4 cap sites and first 32-37 nucleotides of E4 mRNA. The 
deleted sequences were replaced with a fragment derived by PGR which contains open 
reading frame 6 of Ad2 (nucleotides 34082-33178) and a synthetic poly A addition signal. 
The topology of the 3' end of the molecule is shown in Figure 34. The sequence of this 
segment of the molecule will be confirmed. The remainder of the Ad2 viral DNA sequence is 
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published in Roberts, RJ. in Adenovirus DNA. (W. Oberfler, Matinus Nihofif Publishing, 
Boston, 1986 ). The overall size of the Ad2-ORF6/PGK-CFTR vector is 36,336 bp which is 
101.3% of full length Ad2. See Table HI for the sequence of Ad2-ORF6/PGK-CFTR. 
The CFTR transcript is predicted to initiate at one of three closely spaced 
5 transcriptional start sites in the cloned PGK promoter (Singer-Sam et al. (1984) Gene 32:409- 
417) at nucleotides 828, 829 and 837 of the recombinant vector (Smger-Sam et al. (1984) 
Gene 32:409-417). A hybrid 5' untranslated region is comprised of 72, 80 or 81 nucleotides 
of PGK promoter region, 86 nucleotide of linker sequence, and 10 nucleotides derived from 
the CFTR insert. Transcriptional termination is expected to be directed by the BGH poly A 

1 0 addition signal at recombinant vector nucleotide 5530 yielding an approximately 4.7 kb 
transcript. The CFTR coding region comprises nucleotides 1010-5454 of the recombinant 
virus and nucleotides 182, 181 or 173 to 4624, 4623, or 4615 of the PGK-CFTR-BGH 
mRNA respectively, depending on which transcriptional initiation site is used. Within the 
CFTR cDNA there are two differences from the published (Riordan et al, cited supra) cDNA 

15 sequence. An A to C change at position 1990 of the CFTR cDNA (published CFTR cDNA 
coordinates) which was an error in the original published sequence, and a T to C change 
introduced at position 936. The change at position 936 is translationally silent but increases 
the stability of the cDNA when propagated in bacterial plasmids (Gregory et al. (1990) 
Nature 347:382-386; and Cheng et al. (1990) Cell 63:827-834). The 3' untranslated region of 

20 the predicted CFTR transcript comprises 21 nucleotides of linker sequence and 
approximately 10 nucleotides of synthetic BGH poly A additional signal. 

Although the activity of CFTR can be measured by electrophysiological methods, it is 
relatively difficult to detect biochemically or immunocytochemically, particularly at low 
levels of expression (Gregory et al., cited supra\ and Denning et al. (1992) J. Cell Biol 

25 1 1 8:55 1 -559). A high expression level reporter gene encoding the E. coli p galactosidase 
protein fused to a nuclear localization signal derived from the SV40 T-antigen was therefore 
constructed. Reporter gene transcription is driven by the powerful CMV early gene 
constitutive promoter. Specifically, the El region of wild type Ad2 between nucleotides 357- 
3498 has been deleted and replaced it with a 515 bp fragment containing the CMV promoter 

30 and a 3252 bp fragment encoding the p galactosidase gene. 



Regulatory Characteristics of the Elements of the AD2>ORF6/PGK-CFTR 

In general terms, the vector is similar to several earlier adenovirus vectors encoding 
CFTR but it differs in three specific ways from the Ad2/CFTR-1 construct. 

PGK Promoter 

Transcription of CFTR is from the PGK promoter. This is a promoter of only 
moderate strength but because it is a so-called house keeping promoter we considered it more 
likely to be capable of long term albeit perhaps low level expression. It may also be less 
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likely to be subject to "shut-down" than some of the very strong promoters used in other 
studies especially with retroviruses. Since CFTR is not an abundant protein longevity of 
expression is probably more critical than high level expression. Expression from the PGK 
promoter in a retrovirus vector has been shown to be long lasting (Apperley et al. (1991) 
5 j5W78:310.317). 

Polyadcnylation Siignal 

Ad2-ORG6/PGK-CFTR contams an exogenous poly A addition signal after the CFTR 
coding region and prior to the protein IX coding sequence of the Ad2 El region. Since 

1 0 protein is believed to be involved in packaging of virions, this coding region was retained. 
Furthermore, since protein IX is synthesized from a separate transcript with its own promoter, 
to prevent possible promoter occlusion at the protein DC promoter, the BGH poly A addition 
signal was inserted. There is indirect evidence that promoter occlusion can be problematic m 
that Ad2/CMV pGal grows to lower viral titers on 293 cells than does Ad2/pgal-l . These 

1 5 constructs are identical except for the promoter used for p galactosidase expression. Since 
the CMV promoter is much stronger than the El a promoter it is probable that abundant 
transcription from the CMV promoter through the p galactosidase DNA into the protein IX 
coding region reduces expression of protein IX from its own promoter by promoter occlusion 
and that this is responsible for the lower titer of Ad2/CMV-Pgal obtained. 

20 . 

Alterations of the E4 Region 

A large portion of the E4 region of the Ad2 genome has been deleted for two reasons. 
The first reason is to decrease the size of the vector used or expression of CFTR. Adenovirus 
vectors with genomes much larger than wild type are packaged less efBciently and are 

25 therefore difficuh to grow to high titer. The combination of the deletions in the El and E4 
regions in Ad2-ORF6/PGK-CFTR reduce the genome size to 101% of wild type. In practice 
it is straightforward to prepare high titer lots of this virus. 

The second reason to remove. E4 sequences relates to the safety of adenovirus vectors. 
A goal of these studies is to remove as many viral genes as possible to inactive the Ad2 virus 

30 backbone m as many ways as possible. The OF 6/7 gene of the E4 region encodes a protein 
that is involved in activation of the cellular transcription factor E2-F which is in turn 
unplicated in the activation of the E2 region of adenovirus (Hemstrom et al. (1991) J. ViroL 
65:1440-1449). Therefore removal of ORF6/7 from adenovirus vectors may provide a further 
margin of safety at least when grown in non-proliferating cells. The removal of the El region 

35 already renders such vectors disabled, in part because El a, if present, is able to displace E2-F 
from the retinoblastoma gene product, thereby also contributing to the stimulation of E2 
transcription. The 0RF6 reading frame of Ad2 was added back to the E1-E4 backbone of the 
Ad2-ORF6/PGK-CFTR vector because ORF6 function is essential for production of the 
recombinant virus in 293 cells. 0RF6 is believed to be involved in DNA replication, host 
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cell shut off and late xnRNA accumulation in the normal adenovirus life cycle. The E1-E4- 
ORF6"*" backbone Ad2 vector does replicate in 293 cells. 

The promoter/enhancer use to drive transcription of 0RF6 of E4 is the endogenous E4 
promoter. This promoter requires El a for activation and contains El a core enhancet 
5 elements and SPl transcription factor binding sites (reviewed in Berk, AJ. (1986) Ann. Rev. 
Genet. 20:75-79). 

RcpUcation Origin 

The only replication origins present in Ad2-ORF6/PGK-CFTR are those present in 
1 0 the Ad2 parent genome. Replication of Ad2-ORF6/PGK-CFTR sequences has not been 
detected except when complemented with wild type El activity. 

Steps Ussd to Pcrive the DNA ConstroPt 

Construction of the recombmant Ad2-ORF6/PGK-CFTR virus was accomplished by 
1 5 in vivo recombmation of Ad2-ORF6 DNA and a plasmid containing the 5* 1 0.7 kb of 
adenovirus engineered to have an expression cassette encoding the human CFTR cDNA 
driven by the PGK promoter and a BGH poly A signal in place of the El coding region. 

The generation of the plasmid, pBRAd2/PGK-CFTR is described here. The starting 
plasmid contains an approximately 7.5 kb insert cloned into the £M and BamHI sites of 
20 pBR322 and comprises the first 1 0,680 nucleotides of Ad2 with a deletion of the Ad2 
sequences between nucleotides 356 and 3328. This plasmid contains a CMV promoter 
inserted into the £M and Spel sites at the region of the El deletion and is designated 
pBRAd2/CMV. The plasmid also contains the Ad2 5' ITR, packaging and replication 
sequences and El enhancer. The El promoter, Ela and most of Elb coding region has been 
25 deleted. The 3' terminal portion of the El b coding region coincides with the pIX promoter 
which was retained. The CMV promoter was removed and replaced with the PGK promoter 
as a CM and Spel fi-agment from the plasmid PGK-GCR. The resulting plasmid, 
pBRAd2/PGK, was digested with Avrll and BstBI and the excised fragment replaced with the 
Spel to BstBI fragment from the plasmid construct pAd2Ela/CFTR. This transferred a 
' 30 fragment containing the CFTR cDNA, BGH poly A signal and the Ad2 genomic sequences 
from 3327 to 10,670. The resulting plasmid is designated pBRAd2/PGK-CFTR. The CFTR 
cDNA fragment was originally derived from the plasmid pCMV-CFTR-936C using 
restriction enzymes Sfifil and Ecll36IL pCMV.CFTR-936C consists of a minimal CFTR 
cDNA encompassing nucleotides 123-4622 of the published CFTR sequence cloned into the 
35 multiple cloning site of pRC/CMV (Invitrogen Corp.) using synthetic linkers. The CFTR 
cDNA within this plasmid has been completely sequenced. 

The Ad2 backbone virus with the E4 region that expresses only open reading frame 6 
was constructed as follows. A DNA fragment encoding 0RF6 (Ad2 nucleotides 34082- 
33 1 78) was derived by PCR with 0RF6 specific DNA primers. Additional sequences 
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supplied by the oligonucleotides include cloning sites at the 5' and 3' ends of the PGR 
fragment. (Clal and BamHI respectively) and a poly A addition sequence AATAAA at the 3* 
end to ensure correct polyadenylation of 0RF6 mRNA, The PGR fragment was cloned into 
pBluescript (Stratagene) along with an Ad2 fragment (nucleotides 35937-35577) containing 
5 the inverted terminal repeat, E4 promoter, E4 mRNA cap sites and first 32-37 nucleotides of 
E4 mRNA to create pORF6. A S^-EfflnHI fragment encompassing the ITR and 0RF6 was 
used to replace the Sall-BamHI fragment encompassing the ITR and E4 deletion in pAdAE4 
contains the 3* end of Ad2 from a Spel site to the 3' ITR (nucleotides 27123-35937) and is 
deleted for all E4 sequences including the promoter and poly A signal (nucleotides 32815- 
10 35641). The resulting construct, pAdE40RF6 was cut with Eael and ligated to Ad2 DNA 
digested with Pad nucleotide 28612). 293 cells were transfected with the ligation reaction to 
generate virus containing only open reading frame 6 fit)m the E4 region. 

In Vitro Studies w ith Ad2-ORF6/PGK-CFTR 

15 The ability of Ad2-ORF6/PGK-GFTR to express GFTR in several cell lines, including 

human HeLa cells, human 293 cells, and primary cultures of nomial and GF human airway 
epithelia was tested. As an example, the results from the human 293 cells is related here. 
When himian 293 cells were grown on culture dishes, the vector was able to transfer GFTR 
cDNA and express GFTR as assessed by immunoprecipitation and by frmctional assays of 

20 halide efflux. Gregory, R.J. et al. (1990) Nature 347:382-386; Gheng, S.H. et al. (1990) Cell 
63:827-834. More specifically, procedures for preparing cell ly sates, immunoprecipitation of . 
proteins using anti-GFTR antibodies, one-dimensional peptide analysis and SDS- 
polyacrylamide gel electrophoresis were as described by Gheng et al. Gheng, S.H. et al. 
(1990) Cell 63:827-834. Halide efflux assays were performed as described by Gheng, S.H. et 

25 al. (1991) Cell 66:1027-1036. cAMP-stunulated GFTR chloride channel activity was 

measured using the halide sensitive fluorophore SPQ in 293 cells treated with 500 lU/cell 
Ad2-ORF6/PGK-GFTR. Stimulation of the mfected cells with forskolin (20 ^iM) and IBMX 
(100 |im) increased SPQ fluorescence indicating the presence of functional chloride channels 
produced by the vector. 

30 Additional studies using primary cultures of human afrway (nasal polyp) epithelial 

cells (from GF patients) infected with Ad2-ORF6/PGK-GFTR demonstrated that Ad2- 
0RF6/PGK-GFTR infection of the nasal polyp epithelial cells resulted in the expression of 
cAMP dependent Gl" channels. Figure 35 is an example of the results obtained from such 
studies. Primary cultures of GF nasal polyp epithelial cells were infected with Ad2- 

35 0RF6/PGK-GFTR at multiplicities of 0.3, 3, and 50. Three days post infection, monlayers 
were moimted in Ussing chambers and short-circuit current was measured. At the indicated 
times: (1) 10 amiloride, (2) cAMP agonists (10 fiM forskolin and 100 ^M IBMX), and 
(3) 1 mM diphenylamine-2-carboxylate were added to the mucosal solution. 
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In ViivQ Stwdks with Ad2-O RF6 /P GK - C FTR 
Yim preparation 

Two preparations of Ad2-ORF6/PGK-CFTR virus were used in this study. Both were 
5 prepared at Genzyme Corporation, in a Research Laboratory. The preparations were purified 
on a CsCl gradient and then dialyzed against tris-buffered saline to remove the CsCl. The 
preparation for the first administration (lot #2) had a titer of 2 x 1 0^ 0 nj/ml. The preparation 
for the second administration (lot #6) had a titer of 4 x 10^^ lU/ml. 

10 Aqmals 

Three female Rhesus monkeys, Macaca mulatto, were used for this study. Monkey C 
(#20046) weighed 6.4 kg. Monkey D (#20047) weighed 6.25 kg. Monkey E (#20048) 
weighed 10 kg. The monkeys were housed in the University of Iowa at least 360 days before 
the start of the study. The animals were maintained with free access to food and water 

1 5 throughout the study. The animals were part of a safety study and efficacy study for a 
different viral vector (Ad2/CFTR-1) and they were exposed to 3 nasal viral instillation 
throughout the year. The previous mstillation of Ad2/CFTR-1 was performed 116 days prior 
to the initiation of this study. All three Rhesus monkeys had an anti-adenoviral antibody 
response as detected by ELISA after each viral instillation. There are no known contaminants 

20 that are expected to interfere with the outcome of this study. Fluorescent lighting was 
controlled to automatically provide alternate light/dark cycles of approximately 12 hours 
each. The monkeys were housed in an isolation room in separate cages. Strict respiratory 
and body fluid isolation precautions were taken. 

25 Virus administration 

For application of the virus, the monkeys were anesthetized by intramuscular injection 
of ketamine (15 mg/kg). The entire epithelium of one nasal cavity in each monkey was used 
for this study. A foley catheter (size 10) was inserted through each nasal cavity into the 
pharynx, the balloon was inflated with a 2-3 ml of air, and then pulled anteriorly to obtain a 

30 tight occlusion at the posterior choana. The Ad2-ORF6/PGK-CFTR virus was then instilled 
slowly into the right nostril with the posterior balloon inflated. The viral solution remained 
in contact with the nasal mucosa for 30 min. The balloons were deflated, the catheters were 
removed, and the monkeys were allowed to recover from anesthesia. 

On the first administration, the viral preparation had a titer of 2 x 10^® lU/ml and 

35 each monkey received approximately 0.3 ml. Thus the total dose applied to each monkey 
was approximately 6.5 x 10^ lU. This total dose is approximately half the highest dose 
proposed for the himian study. When considered on a lU/kg basis, a 6 kg monkey received a 
dose approximately 3 times greater that the highest proposed dose for a 60 kg human. 
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Tinning Qf evaluation?. 

The animals were evaluated on the day of administration, and on days 3, 7, 24, 38, 
and 44 days after infection. The second administration of virus occurred on day 44. The 
mionkeys were evaluated on day 48 and then on days 55, 62, and 129. 
5 For evaluations, monkeys were anesthetized by intramuscular injection of ketamine 

(1 5 mg/kg). To obtain nasal epithelial cells after the first viral administration, the nasal 
mucosa was first impregnated with 5 drops of Afiin (0.05% oxymetazoline hydrochloride, 
Schering-Plough) and 1 ml of 2% Lidocaine for 5 minutes. A cytobrush was then used to 
gently rub the mucosa for about 3 sec. To obtain pharyngeal epithelial swabs, a cotton-tipped 
1 0 applicator was rubbed over the back of the pharynx 2-3 times. The resulting cells were 
dislodged from brushes or applicators into 2 ml of sterile PBS. After the second 
administration of Ad2-ORF6/PGK-CFTR, the monkeys were followed clmically for 3 weeks, 
and mucosal biopsies were obtained from the monkeys medial turbinate at days 4, 1 1 and 1 8. 

15 Animal evaluation. 

Animals were evaluated daily for evidence of abnormal behavior of physical signs. A 
record of food and fluid intake was used to assess appetite and general health. Stool 
consistency was also recorded to check for the possibility of diarrhea. At each of the 
evaluation time points, rectal temperature, respiratory rate, and heart rate were measured. 

20 The nasal mucosa, conjuctivas and pharynx were visually inspected. The monkeys were also 
exanuned for lymphadenopathy. 

Hematologv and serum chemistrv 

Venous blood from the monkeys was collected by standard veniptmcture technique. 
25 Blood/serum analysis was performed in the clinical laboratory of the University of Iowa 
Hospitals and Clinics using a Hitatchi 737 automated chemistry analyzer and a Technicom 
H6 automated hematology analyzer. 

Serology 

30 Sera from the monkeys were obtained and anti-adenoviral antibody titers were 

measured by ELISA. For the ELISA, 50 ng/well of killed adenovirus (Lee Biomolecular 
Research Laboratories, San Diego, Ca) was coated in O.IM NaHCOs at 4^ C overnight on 96 
well plates. The test samples at appropriate dilutions were added, starting at a dilution of 
1/50, The samples were incubated for 1 hour, the plates washed, and a goat anti-human IgG 

35 HRP conjugate (Jackson ImmunoResearch Laboratories, West Grove, PA) was added for 1 
hour. The plates were washed and O-Phenylenediamine (OPD) (Sigma Chemical Co., St. 
Louis, MO) was added for 30 min. at room temperature. The assay was stopped with 4.5 M 
H2SO4 and read at 490 nm on a Molecular Devises microplate reader. The titer was 
calculated as the product of the reciprocal of the initial dilution and the reciprocal of the 
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dilution in the last well with an OD>0.1 00. Nasal, washings from the monkeys were obtained 
and anti-adenoviral antibody titers were measured by ELISA, starting at a dilution of 1/4. 

N9S&1 Washings, 

5 Nasal washings were obtained to test for the possibility of secretory antibodies that 

could act as neutralizing antibodies. Three ml of sterile PBS was slowly instilled into the 
nasal cavity of the monkeys, the fluid was collected by gravity. The washings were 
centrifuged at 1 000 RFM for 5 minutes and the supernatant was used for anti-adenoviral, and 
neutralizing antibody measurement 

10 

Cvtology 

Cells were obtained from the monkey's nasal epithelium by gently rubbmg the nasal 
mucosa for about 3 seconds with a cytobrush. The resulting cells were dislodged frY)m the 
brushes into 2 ml of PBS. The cell suspension was spun at 5000 ipm for 5 min, and 
1 5 resuspended in 293 media at a concentration of 1 0^ cells/ml. Forty ^1 of the cell suspension 
was placed on slides using a Cytospin. Cytospin slides were stained with Wright's stain and 
analyzed for cell differential using light microscopy. 

Culture for Ad2-ORF6/PFK-CFTR 

20 To assess for the presence of infectious viral particles, the supernatant from the nasal 

brushings and pharyngeal swabs of the monkeys were used. Twenty-five |iil of the 
supernatant was added in duplicate to 293 cells. 293 cells were used at 50% confluence and 
were seeded in 96 well plates. 293 cells were incubated for 72 hours at 37^C, then fixed with 
a mixture of equal parts of methanol and acetone for 10 min and incubated with an FITC 

25 label anti-adenovirus monoclonal antibodies (Chemicon, Light Diagnostics, Temecuca, Ca) 
for 30 min. Positive nuclear immunofluorescence .was interpreted as positive culture. 

ImmTOQpytQghemiistry for thg dctgction of CFTRt 

Cells were obtained by brushing. Eighty ^1 of cell suspension were spun onto gelatin- 

30 coated slides. The slides were allowed to air dry, and then fixed with 4% paraformaldehyde. 
The cells were permeabilized with 0.2 Triton-X (Pierce, Rockford, II) and then blocked for 60 
minutes with 5% goat serum (Sigma, Mo). A pool of monoclonal antibodies (Ml 3-1, Ml -4, 
and M6-4) (Gregory et al., (1990) Nature 347:382-386); Denning et al„ (1 992) J. Cell Biol 
118:(3) 551-559); Denning et al., (1992) Nature 358:761-764) were added and incubated for 

35 12 hours. The primary antibody was washed off and an antimouse biotinylated antibody 

(Biomeda, Foster City, Ca) was added. After washing, the secondary antibody, streptavidin 
FITC (Biomeda, Foster City, Ca) was added and the slides were observed with a laser 
scanning confocal microscope. 
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To assess for histologic evidence of safety, nasal medial turbinate biopsies were 
obtained on day 4, 1 1 and 18 after the second viral administration as described before 
(Zabner et al (1993) Human Gene Therapy, in press). Nasal biopsies were fixed in 4% 
5 formaldehyde and H&E stained sections were reviewed. 

RESULTS 

Studies of efiRcacy. 

1 0 To directly assess the presence of CFTR, cells obtained by brushing were plated onto 

slides by cytospin and stained with antibodies to CFTR. A positive reaction is clearly evident 
m cells exposed to Ad2-ORF6/PGK-CFTR. The cells were scored as positive by 
inmiunocytochemistry when evaluated by a reader blinded to the identity of the samples. 
Cells obtamed prior to infection and fi^om other untreated monkeys were used as negative 

1 5 controls. Figures 36A-36D, 37A-37D, and 38A-38D show examples fi-om each monkey. 



Studies of safety 

None of the monkeys developed any clinical signs of viral infections or inflammation. 

20 There were no visible abnormalities at days 3, 4, 7 or on weekly inspection thereafter. 

Physical examination revealed no fever, lymphadenopathy, conjunctivitis, coryza, tachypnea, 
or tachycardia at any of the time points. There was no cough, sneezing or diarrhea. The 
monkeys had no fever. Appetites and weights were not affected by virus administration in 
either monkey. The data are summarized in Figures 39A-39C. 

25 The presence of live virus was tested in the supematant of cell suspensions firom 

swabs and brushes firom each nostril and the phaiynx. Each supematant was used to infect 
the virus-sensitive 293 cell line. Live virus was never detected at any of the tune points. The 
rapid loss of live virus suggests that there was no viral replication. 

The results of complete blood counts, sedimentation rate, and clinical chemistries are 

30 shown in Figure 40A-40C. There was no evidence of a systemic itiQammatory response or 
other abnormalities of the clinical chemistries. 

Epithelial inflammation was assessed by cytological examination of Wright-stained 
cells (cytospin) obtained from brushings of the nasal epithelium. The percentage of 
neutrophils and lymphocytes from the infected nostrils were compared to those of the control 

35 nostrils and values from four control monkeys. Wright stains of cells from nasal brushing 
were performed on each of the evaluation days. Neutrophils and lymphocytes accounted for 
less than 5% of total cells at all time points. The data are shown in Figure 41 . The data 
indicate that administration of Ad2-ORF6/PGK-CFTR caused no change in the distribution 
or number of inflammatory cells at any of the time points following virus administration, 
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even during a second administration of the virus. The biopsy slides obtained after the second 
Ad2-ORF6/PGK-CFTR administration were reviewed by an independent pathologist, who 
found no evidence of inflanmiation or any other cytopathic effects. Figures 42 to 44 show an 
example from each monkey. 
5 Figures 45A-45C shows that all three monkeys had developed antibody titers to 

adenovirus prior to the first infection with Ad2-ORF6/PGK-CFTR (Zabner et al. (1993) 
Human Gene Therapy (in press)). Antibody titers measured by ELIS A rose within one week 
after the first and second administration and peaked at day 24. No anti-adenoviral antibodies 
were detected by ELISA or neutralizing assay in nasal washings of any of the monkeys. 

1 0 These results combined with demonstrate the ability of a recombinant adenovirus 

encoding CFTR (Ad2-ORF6/PGK-CFTR) to express CFTR cDNA in the airway epithelium 
of monkeys. These monkeys have been followed clinically for 12 months after the first viral 
administration and no complications have been observed. 

The results of the safety studies are encouraging. No evidence of viral replication was 

15 found; infectious viral particles were rapidly cleared. The other major consideratipn for 
safety of an adenovirus vector in the treatment of CF is the possibility of an inflammatory 
response. The data indicate that the virus generated an antibody response, but despite this, no 
evidence of a systemic or local inflammatory response was observed. The cells obtained by 
brushings and swabs were not altered by virus application. Since these Monkeys had been 

20 previously exposed three times to Ad2/CFTR-1, these data suggest that at least five 
sequential exposures of airway epithelium to adenovirus does not cause a detrimental 
inflanmiatory response. 

These data indicate that Ad2-ORF6/PGK-CFTR can effectively transfer CFTR cDNA 
to airway epithelium and direct the expression of CFTR. They also indicate that transfer and 

25 expression is safe in primates. 

Eqwiivplents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents of the specific embodiments of the invention 
. 30 described herein. Such equivalents are intended to be encompassed by the following claims. 
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CYSTIC FfBROSIS TR;'J^S1-£EHBRA1>IE COKTDUCT^^J^^CE REGULATOR; CODDN > 

h " HYBRID ElA-CfTR-ElB MESSAGE h > 

860i 123 TO 4622 OF HUMAN CFTR CDIV. 900i 9i0> 



1390 1400 1410 K20 1430 144 



TC^TTACCTC ACWATCXTT CfJJMX tCC iJTCTCTTAJ gCJTJCTg TgGAAGAAg 
ACTAXTGGAC TCTTTACTAX CnTTOTACC TTAGACWlTT CCCTATCXOS ACCu^iw^ 
V I TS EMI ENI QSVK A. Y C WEE> 

afsrrr FIBROSIS TRANSWEKBRWJE CCa«»CTANCE REXHJIATOR; CODO^ > 

h H ySRID EIA-CTTR-EIB M ESSA GE J \ — 

sao i 12 3 TO 4622 OF HUMAN CFTR CDNA 960a«, 970> 

.1450 1460 1470 1480 1490 1500 

CAATGGAAAA AA«MTCAA AACTTAAGAC AAACAGAACT GAAACTGACT CCGAAGGCAG 

cttaccttit.toIcSa^ JJaSncixs TrrcrcTTGA ctttgactcsa gcctocgtc 

AMEK MIE HLR QTEL .KLT RKA> 

_GYSTIC FIBROSIS TRMQSMEMBRMIB CONEOCTANCE REGOIATOR; CODO^ > 

h ^KVBRID EIA-CFTR-EIB MESSAGE W > 

980 i 12 3 TO 4622 OP HW«N CFTR CCWL— Jl020i :1030> 



1510 1520 1530 1540 1550 1560 

CCTATbTCAG ATACTTCAAT AGCTCAGCCT TCTTCTTCTC f^^^^''^^;^ .^^^^^Tn? 
GGATACACTC TATGAAG7TA TCGAGTCGGA AGAAGAAGMJ TCCCAAGAAA CACCACAAAA 
A YVR YFN SSA FFFS O FF VVF> 

CYSTIC FIBROSIS TRMJSMEMBRMJE CONDUCTANCE REGOIATOR; CODON > 

h HYBRID ElA-CFTR-ElB MESSAGE. n > 

I 040 i 12 3 TO 4622 OF HDMM9 CFIR CEMA_1080a ^1090> 

1570 1580 1590 1600 1610 1620 

TATCTGTCCT TCCCTATCCA CTAATCAAAG GAATCATCCT CCGGJ^TA TTCJCGJlCC^ 
ATAGACACGA AGGGATACGT GATXAGTTTC CTTAGTAGGA GGCCmTAT AAGTGGTGGT 
L SVL PYA LIK GIIL RKI FTT> 

_CYSTIC FIBROSIS TRM4SKEMBRM3E CCaTOOCTANCE REGOIATOR; CODOM > 

HYBRID ElA-CFTR-ElB MESSAGE > 

123 TO 4622 OF HUKW CFTR CCN?* 1140a 1150> 



-llOOi. 



1630 1640 1650 1660 1670 1680 

TCTCA1TCTG CATTGrrCTG CGCATGGCGG TCACTCGGCA ATTTCCCTGG GCTGTACAAA 
AGAGTAAGAC GTAACAAGAC GCGTACCGCC AGTGAGCCGT TAAAGGGACC CGACATGTTT 
ISFC IVL RMA VTRO FPW AVO> 

CYSTIC FIBROSIS THANSMSSRP^NE CONDUCTANCE REGOIATOR; CODCa-} > 

h HYBRID S1A-C?TR-E13 MESSAGE > 

lieOi 123 TO 4622 OF KtB>»N CFTR CDNA 1200i 1210> 

1690 1700 1"10 1720 1730 1740 

C-.TC-aTATG^. C^CTCTTC-a-. C---.-.T.-.--^C-. .-.-.-.7.-.C^0:L\ TrTCTT^C;^. AAGCX^GAAT 
GTACC;'.TACT GAGAGAACCT CG7TATTTGT TTTATGTCCT AAAGAATGTT TTCGTTCpA 
7 v: Y D S L G A I Iv K I Q D F L Q K Q i> 

CYSTIC FIBROSIS TjV-Jv'SKSSRAie CONDJCTW^CE P^UIATOR; CODDN > 

h HV3RID ELJ-.-CFTP.-riS MESSAGE h > 

1220i 123 TO 4 622 Or KU>1A1'3 CFTR CV.^ik 1260i 12.^ 



1750 1750 1770 1780 1790 1800 

w 

w 

1~A>.GSC;^TT GOAJ^.TATA-i.C TTA.f.Ca^.CTA CP.G^AGTAGT GATGGAG-.^.T GTA^.CAGCCT 
7ATTCTGT;iJ'. CCTTATATTG A.i.TTGCTG.\T GTCTTCP^.TCA CTACCTCTTA C^.TTGTCG-O.-. 
^ K T L £ Y N L T T T E V V KEN' V T Aj> 

CYSTIC FIBROSIS TKAJ'JSMEMBRAJvE: CO^rDJCTANCE PSCUU-.TO?.: CODDK' > 

'■ '" h HYBRID El^-.-CFTR-ElB KESSAGE h > 

1280i 123 TO 4622 OF KUK--»v CFTR CI^. 1320i 13:- C> 

1810 1820 lc30 1840 1850 ISiZ 
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TCTC«y.Ga^ ooGXTrrGOG Gw^rrArrns acaaaccaxa iJSSSSJ: 

AGACCCTCCT CCCTAWiCCC CTTAATAMlC TCTTrCCTTT ^^TTITCnTA TTGITOIOT 

U^CTii'^PiBSos^s ?rmJme«bi4e ^c«£c|S^ rego iator; CX>D0N > 

i a40 < 1 2a TO 4622 OF HOttN CFTR CDMA 13B0> X3SQ> 

1870 1880 1890 WOO ' 1910 1920 

AWICTTCTAA TG^TGKC AGCCKmCT ^JAJJJ ^^^^1^^ ^^SS^Jg 
inxyAGATT ACCACTACTG.TCGGAQAGA ACTCACTWA GAGTGWlWA CCATCAG^C 

^ ' cYslic'riBWJSis trmJmeSbiw«: (Xmdoctance rbcuiatori casm 

h w yBRID EIA-CFTR-EIB MESSAGE __n 

140Q i 1L2 3 TO 4622 OF HUMAN CFTR CBMA 1440a 1450a. 

.,1930 1940 1950 I960 1970 - 19 80 

TCCTCAAAGA TATTAATTTO AACyiTAGwI GAGGACAGTT GTTGGOGGTT ^J^^ 
AGGACrriCT ATAATTAAAC TTCTATCTTr CTCCTGTCAA CAACCGCCAA CGAOTAGGT 
V'LKD I MF KIE RGQI* lav AGS>- 

CYSTIC FIBROSIS TOANSMEHBRAME CONEOCXANCE RBGOIATOR? CODQN > 

h H VBRID EIA-CFIR-EIB MESSAGE- ' P rrr;^ 

1 46q V 12 3 TO 4622 OF HDMAN CFTR CENA ^ISOOi : ^1510> 

1990 2000 2010 2020 2030 2040 

CTCGAGCAGG CAAGACTTCI CTTCTAXTGA ^ATTAT^ JS^SSS? SSSSS 
GACCTCGTCC GTTCTGAAGT GAAGATTACT ACTAATACCC TCTTGACCTC GGAAGTCTCC 
Tri»f5 KTS LLMMIMG ELE P.SE> 

^S^IC Fl^SIS TRANSMEHBRANE CONDUCTANCE RSGUIATOR; CODON > 

' h H VBRID ElA-CFTR-ElB MESSAGE —J} 

l S20 i 123 TO 4622 OF HUMAN CFTR CDNA 1560a 1570> 

2050 2 060 2070 2080 2090 21 00 

GTAAAA.TrA;^ GO.CAGTGGA AGA^^.TTTCAT TCTGTTCTCA GyT^SIS J^^SSSc 
ttTTTTAATT CGTGTCACCT TCTTAAAGTA AGACAAaAGT CA^G^ACC T.j.^TACG^C 

^ CYSTIC^-IBROSIS TRANSMSMBRAIn^ COl^CTANCS REGUIATOR; CODON > 

" L ^yv3»TO ElA-CrTP.-ZlB MESSAGE ti > 

Q< 123' TO 4622 or HUHy^ CFTR CIX^A. 1620i i630> 



ISSOi. 

-.-.-.0 2-20 2110 2140 2150 2160 

CZXCC^.TT!.^. AG;:.y^.TATC A7CTTTGGTG TTTCCTATGA TGAATATAGA TACAGAAGCG 
CGTGGTAArr TCTTTTATAG TAGAAACCAC A^AGG^.TACT «CTT,-.T«TCT «i«.CTTC^-^ 
G-Tj, ^K I IFGVSYD EYR .•'RS> 

Cvs^IC -IBROSIS TRAKSMD-EPA)n- COICD'JCTAIOCE REG'JL.i.TOR; CODON > 

~ V. •jNrT:oT2 £2>.-crT=-I13 i<Z.SS)^QZ n > 

164 0'i' 123 TO 4622 O" HlWAls^ CFTR CD>iA 1680i 1650> 

2170 2180 2150 2200 2210 2220 

^rj.r-. ;iTGCCAACTA C- '-^GP.GO.KC'^ TCTCCAAGTT TGC-.a^GA^-^ a-.C^ATATAG 

AGTACTTT^CG TACGGTTGAT CTTCTCCTGT AG^C-GTrC^A ACGTCTCTTT CTGTTATATC 
vi'ka col Z Z D ISXF AEK DKI> 

CYSTIC FIBROSIS TR;^Js'S>'EQRAJ>E C0NT)-JCT;'J^CE REGUI.-.TOR; COIX)N > 

h HYBRJD E1A-CFTR-II5 MESSAGr. ^_.> 

1700i 123 TO 4 622 Or KUM.«<' CFTR CEW. 174 Oi 1 /:>-•> 
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2230 22«0. . 2250 2260 2270 22B0 

ITCTTOGAGX ACCTCGAATC ACACTGWH^ JSS^SJS ISS^^ 

AACyACCTCT TCCWXHTAG TGTGACTCAC CTCCAGTTGC TCCTTCTTAA ^igAAATCGTT 
VL.GE GGZ TLS CCQR A-R* 5l# A> 

CTSTIC FIBROSIS CCNbOCTANCE RBCSOIATOIU CODOM ^> 

h H YBRID EIA-CFTO-EIB ^SSSyg ^ ——J — ^ 

l?6o V 12 3 TO 4622 OF HUKW CFTR CO^ .1800i_ ^1B?.0> 

2290 2300 2310 2^i20 2330 2340 

GAGCAGTATA CAAAGATCci GATITCTArr TATTAGACTC jaTTTTOGA JJCg^TC 
CTCGICATAT GTITCTACGA CTAAACATAA ATAATCTGAG ACGAAAACCT ATCGATCTAC 
RAVY KDA DLY LLDS PFG y i*J5> 

OrSTIC FI^SXS TRANS»4EWBR»NE CXWDXTANCE REGOLATOR; CODON ^> 

"■ . h • HYBRID EIA-CFTH-EIB MESSAGE . a > 

1 B2Q ? ^ fS 1622 OF HIWAN CFTO C3ia 1860i 1870> 

'*2350 2360 2370 2380 23$0- 2400 

tittaacagI aaaagaaata tttgaaw^ct gtctctctaa actgatggct aacmaacta 

AAAATTGTCT ymCTiT AT AAACTTTCGA CACAGACATO TGACTACCGA 7TGTTTTGAT 
VLTE KEI FES CVC K LMA K JH T> 

CYSTIC FIBROSIS TRftNSKEMBRftNE COMDDCTAMCE RBGOEATOR; CODOJ^ ^> 

js HYBRID EiA-CFTR-ElB MESSAGE ^ » ^> 

IBBPi 123 TO 4622 OF HOMM? CFTR CENft 1920:i ^1930> 



2410 2420 2430 2440 2450 .. 24 60 

GGATTTTOGT CMTPTCTAAA ATGGAACATT TAAAGAAAGC TGACAAAATA TTAATTTTGC 

tIS^ ^rr^G ACTcnrrrAT aattaaa^g 

CYSTIC FIBROSIS TRAKSMSKBRANE CONDOCTANCE REGDIATOR; CODON > 

-h wvaMD SlA-CrTR-ElB . MESSAGE . n > 

1940^ n22^HUMM^ CFTR CENA ^1980i 1990> 

2470 2480 2490 2500 2510 2520 

ATGAAGGTAG C^lGCTATITT TATGGGXCAT rTTCAGAACT CCAAA?.TCTA Ej^EEflE^E: 
TACITCCATC GTCGA.TAAAA ATACCCTCTA AAAGTCTTGA GGpTT«GAT GTCGGTCT^ 
H^GS S ^ vGT FSEL QNL QPD> 

CYSTIC FIBROSIS TK^StEyZK-i^ COlTOaCTANCE REGULATOR; CODON > 

h ' " -•^3?JID El>.-CrT?.-ri3 KZSSAGE h > 

2000i 123 TO 4622 OF HUMAW CFTR CDE'U-. 2040i 2050> 



2330 -2540 21.50 2560 2570 25£C 

TTAGCTCAAA ACTCi.TGGGA TGTGATTCTT TCGA.CCAA.TT TAGTGCAGAA A G^^G^J-J-. Ti 
AA.TCGAGTTT TGAGTACCCT ACACT;-JiGAA AGCTGGTrAJ.. ATCi'.CGTCTT TCTTCrTT^.^. 
-SSK LMG CDS FDOF SAi RRK> 

CYSTIC FT3R0S-S TR?--NSHS>S3.--Kr: CCIHXJCTAI^CE RSGULA.TOR; CODDSv > 

h" ' HYBRID EL^.-CFTR-EIB J-ESSAGE 

2060i 123 TO 4622 OF KUKJ-J^ CFTR CKV. 2100i 2ilC> 

2590 2600 2610 2620 2630 2640 

CA;=TCCTAAC TG-IiGACCTTA CA.CCGTTTCT- C;-.TrAG.AAGG AGATGCTCCT GTCTCCTGGA 
GrrAGGATTG ACTCTGGAA.T GTGGCAA.i^GA GTAATCTTCC TCTACGAGG.A CAGAGG.ACC. 
SI-LT ETL HRF SLEG DAP VSW> 

CYSTIC FIBROSIS TRA-NSMSKB-VJTC CONFDUCTAl'JCE REGUI.ATO?.; CODON > 

' h HYBRID ElA.-CrrR-ElB HIISSAGE r. > 

2120i 123 TO 4 622 O" HU)<Als' CFTR CU'i.-. 2150^ 



1 / uosrj/ now / 



2700 




2770 2780 * ^^^^^^ tCCTTAGTAC 

<=«i 

.2300i .123 TO 462 ^^^^ ^880 




.w,u 2540 

•5020 

2900 2910 2,20 

2890 29UU GACACACTCA GTTAACC^^G 

.=a«^- sssss -rrf 
:OX.i.=^- -^i^s jocc«__^> 

^ HVBRIP err?. C!»l>. 2460i . 

^OCi 1^^ - « _ 3000 






^o.a=.c.; TCicco^jc -^g-i -r^?;;; -=a.=..,c 




3050 5040 3050 

•3020 



c>>— ^ ^sissi sssis^ ^c;i«i 

cTTTa«CTC ..c^cc^A Y ^ ^oi^Tii..: co=:-__ 



2 S^0j 123 TO <622 OF HIWAN CFTR C33^?^— r-2580i ^2S90> 

3070 3080 3090 3100 3110. 3120 

TfAGTGMGk AATTAACCWl GWt(aiCTTAA ACCXCTCSCCT TliTSA^^ ATCGJGAGCJi 

KTrct^xmcr ttaatoctt crrcTCMiTr tcctcacgga amactacia tacctctcct 

ZS£E IHE EDL KECL FDD HES> 

_CySTIC FIBROSIS TRANSMEMBRANE CONDOCTANCE REGOIATORj CODOI^^ L> 

h HyBRID EIA-CFTR*E1B HESSAGE n , 

3130 3140 3150 3160 • 31^ 3180 

TACCAGCAGT GACTACATGG AACACATACC TTCGATATAT ^^S&FI^ AAGAGOT^ 
ATGGTCGTCA CTGATGTACC TTGTCTATGG AAGCTATATA ATGACAGGTG TTCTCGAATT 
IPAV TTW K T Y ' h K Y 1 TVH KSL> 

' g ygPIC FIBROSIS TRANSMEMBRANE CMNDOCTANCE RBGOIATORl CODGN _> 

h HYBRID EIA-CFTR-EIB HESSAGE h > 

\2660 i 12 3 TO 4622 OP HOMAN CFTR CEMA 27p.0a : ^2710> 

3190 3200 3210 3220 3230 3240 

TTTTTGTGCT AATTTGGTGC TTAGTAATTT- TTCTGGCAGA GGTGGCTGCT TCrTTGCTTG 
AAAAACACGA TTAAACXSlCG AATCATEAAA AAGACCGTCT CCACCGACGA AGAAACCAAC 
IF-VL IWC LVI - LA E VAA SLV> 

g ySTIC FIBROSIS TRANSMEMBRANE tC'JDDCTANCE. REGUIATOR; CODON ^> 

h___JffBRID ElA-CFTR-ElB HESSAGE ^ h ^> 

2 72q 1 123 TO 4622 OF HO«M? CFTR CENA 2760a, , 2770> 

3250 3260 3270 3280 3290 3 3 00 

TGCTGTCGCT CCITCGAAAC ACTCCTCTTC *AGJOAAGG GAATAGTACT CATACTAGAA 
ACGACACC6A GGAACCTTTC T6AGGAGAAG .TTCTGTITCC CTTATCATGA GTATCATCTT 
VL WL LGN TPL QDKG NST HSR> 

_CYSTIC FIBROSIS TRANSMEMBRANE CC3NDDCTANCS REGUIATOR; CODCW > 

h H YBRID ElA-CrTR-ElB MESSAGE h > 

2 7BQi 123 TO 4622 O? HUMAN CFTR CDNA ^2820i 2830> 

3310 3320 3330 3340 3350 3360 

ATAACAGCTA TCCAGTGATT ATCACCAGCA CCAGTTCGTA TTATGTGTTT TACATTTACG 
TATTGTCGAT ACGTCACTAA TAGTGGTCGT CGTCAAGCAT AATACACAAA ATGTAAATGC 
WNSY AVI ITS TSSY YVr YIV> 

CYSTIC FIBROSIS TRW\'SHEMSRANE COJroUCTANCE REGULATOR; CODON > 

h " " kY3HID ZIA-CFTR-EIB KESSAGE h > 

2840^ ■>23 TO <622 OF KUJl-JJ CFTR CEgQA. 2B80i 2E50> 



3370 



3580 3590 3400 3410 3420 



TGGG.i.GTAGC CGA.Z;=.CTTTG CTTGCTATGG G^.TTCTTCAG AGGTCTACCA CTGGTGCS.TA 
.VC^C^O'TCG GCTGTGAX'iC GWiCr^XTACC CTAAGAACTC TCCAGA.TGCT GACCACGTAT 
VGVA DTL LAH GFFR GL P LVH> 

CYSTIC rlBROSIS TRANSHZ>i3RWvE CONDUCTANCE REGIJL^TO?. ; CODDK > 

h " ~ HY3klD''Ei;-.-Crr?.-ElS heSSAGE h > 

2900i 123 TO 4522 OF KUK-tJ^l CFTR CD1>IA 2940i 2550> 

3430 3440 5450 3460 3470 34S0 

CTCTAATCAC AGTGTCGAAA ATTTTAC;'.CC ACAAAATGTT AC;=.TTCTGTT CTTCAAGC-.C 
aAGATTAGTG TCACAGCTTT TA-iJ-J^TGTGG TCrTTTAC^JV TGTAA.GACAA G^-JiGTTCGTG 
TLIT VSK ILK HKML HSV LQAj- 
^CYSTIC FIBROSIS TRANSHEJ-JBR^^Js'E CONDJCTANCE REGUU^TOR ; CODD;-." 



yyu ytvuMV rv^i/uayj/iioo/ 
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JJYBRID EaA-CrXR-ElB' MESSAGE 

_ _ _ . >A - 



^2960S 123 TO 4622 OF KOM^N CFTR CEtOA 30001 _^3D10> 

3490 3500* 3510 . 3S20 35.30 3540 

CTATCTCAAC CCTCAACACG TTGWACCAC CTCK3GMTCT TJATAGATTC '^^^fJAOKTA 
GATACWrrrC GGAGTTCTCC MCrnCCTCCACCCTAXGA ATTATCTAAC AQCTTTCTAT 
PMS T LKT LKA CGIL NRF SKD> 

CVSTIC FIBROSIS T1»NSMEMBJ«NE. CONIXXrTAIKX REGOLATOR; CODDN > 

» h H VBRID EIA-Cm-EIB MESSAGE t^^ttV rrrx*' 

a 020 ? 4622 OF HOMAM CFTO COtt 3060S ^3070> 

3550 3560 3570 • 3580 35$0 3600 

TACCAATTTT GGATCACCTT CTCCXTTCTTA CCATATTTGA CTTCATCCAG TltOTATtAA 
ATCGTTAAAA CCTACTCGAA GACGSAGAAT GGTATAAACT GAAGTAGGTC AACAATAATT 
I A IL DDL LPL TI.FD FIQ LLL> 

C YSTIC FIBROSIS TRAMSbEMBRANE CCNCOCTANCE REGOIATOR; CODON > 

h H YBRID EIA-CFTR-EIB MESSAGE ^ -_> 

a OBQi ^123 TO 4622 OF- HUMAN CFTR CCNA_3120^ 313 0> 

3610 3620 3630 3640 3650 3 660 

TTCTCATTCG ACCTATAGCA CTTGTCbCAG "^TAOACC CT CTT^AACAC 
AACACTAACC TCGATATCGT CAACAGCGTC AWATGTTCG 'GATCTAGAAA CAACGTTGTC 
IVIG AIA .V VA VLQP.VIF VAT> 

CYSTIC FIBROSIS TRANSMEMBRANE CCNDOCTANCE REGOIATOR; CODON 9- 

h HYBRID ElA-CFTR-ElB MESSAGE _h .> 

3 14 Oi 1 23 TO 4622 OF HUMAN CFTR CEMA 3180a 3190> 

•3670 3680 3690 3700 3710 ^ 3720 

TGCCAGTGAT AGTGGCTTTT ATTATGTTGA GAGCATATTT On-CCAA AC C TCACAGCAAC 
ACGGTCACTA TCACCGAAAA TAATACAACT CTCGTATAAA GGAGGTTTGG AGTGTCGTTG 
VPVI VAF IML RAYF LOT SOQ> 

^CYSTIC FIBROSIS TRANSMEMBRANE CONDUCTANCE REGULATOR; CODON > 

h H YBRID ElA-CFTR-ElB MESSAGE 

3200i 123 TO 4622 OF HUMAN CFTR CE)MA 32403. 3250> 



3730 3740 3750 3760 3770 3780 

TCAAACAACT G3AATCTGAA GGCAGGAGTC CAATTTTCAC TCATCTTGTT ACAAGCTTA;^. 
AGrrTGTTGA CCTTAGACTT CCGTCCTCAG GTTAAAAGTG AGTAGAACAA TGTTCGAATT 
LKQL ESE GRS PIFT HLV TSL> 

C^'STIC F^EROSIS TRANSKZ>BRAlvE CCKTOCTAKCE P^ULATOR; CODCi-; > 

h HVBRID ElA-CFTR-^lB MESSAGE ^^^> 

2?60'-: --3 TO 4522 6? 'nUmi CFTR CJXV. i^OOi j310> 



3790 3800 3B10 3620 3830 3840 

AAGSACTATG GiCACTTCGT GCCTTCGa^C GSCAGCCTTA CTTT GAAA CT CTGTTCCACi. 
TTCCTGATAC CTGTGAAGCA CGGAAGCCTG CCGTCGGAAT GAAACTTTCA GACAAC-GTGT 
K G L W T L R A F G R Q P V PET L H> 

CYSTIC FIBROSIS TRA2\'S?<EM3RAlvi: COJ^CTAIQCE REGULATOR; COTON > 

h " hybrid EL6.-CrTR-E13 MESSAGE h > 

3320i 123 TO 4622 OF HiS'SAl'J CFTR CI»JA 5360i 33'7 0> 



3850 3860 3870 3880 3890 2900 

AAGCTCTGAA TTTACATACT GCCAACTGGT TCTTGTACCT (.TCA-ACACTG CGCTC-GTTCC 

TTCGAGACTT A.iJ^.TGTATGA CGGTTGACa^- AG.kAC^TC-GA CAGTTGTGAC C-CGA.C3.--AC-G 

K A L H L H T A N W F L V L S T. L R v; T> 
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.CYSTIC FIBROSIS"TRM«SMEJfflRMIB CONDOCTANCE RECOIATORj 
h • H VBRTD EIA-CFTR-EIB MESSAGE rrrr^ 



3380i 123 TO 4622 OF HUMAN CTTR CD«A 34201 3 430> 



3910 3920 .3930 3940 3950 3960 

AAATCAGAAT AGAAATGATT TTTGTCATCT TCTTCArPGC TCT^^TC JJXTOCJTTT 
TTTACTCTTA TCTTTACTAA AAACAGTAGA AGAAGTAACG ACAA3X5BAW5 TAAAa*X'AAA. 
QMRI EMI FVI FFIA VTF ISI> 

CYSTIC FIBROSIS TRANSMEMBRANE CONDOCXANCE BBGOIATORj; CODOl^ > 

h . H VBRID ElA-CFTR-ElB MESSAGE w > 

3440i ^123 TO 4622 OF HUMAN CFTR OXOV 3480a 3490> 

3970 • 3980 -3990 4000 4010 4 0 20 

TAACAACAGG AGAAGGAGAA GGAAGACTTG GTATTATCCT GACITTAGCC ATGAATATCA 
ATTCTTGtCC •ICn CC l'CTT CCTTCTCAAC CATAATAGGA CTGAAATCGG TACTTATAGT 
L T^G EGE CRV GIII* TI.A'M H1» 

CYSTIC FIBROSIS TRANSMEMBRANE CCNE«CTANCE RBGOIATOR; CODCH > 

^ Jj_ HYBRID ElA-CFTR-ElB M ESSA GE b > 

3500i ^123 TO 4622 OF HOMAN CFTR OWy 3540i ; — 3550> 

4030 4040 4050 4060 4070 4080 

TGAGTACATT GCAGTGGGCT GTAAACTCCA GCATAGATGT GGATAGCTTG ' ATGCGATCTG 
ACTCATGTAA CGTCACCCGA CATTT6AGGT CGTATCTACA CCTATCGAAC TACGCTAGAC 
MSTL QWA VNS SIDV PSL MRS> 

CYSTIC FIBROSIS TRANSMEMBRANE CCa«DOCTAMCE REGOIATOR; COpON ^> 

h ^HYBRID ElA-CFTR-ElB MESSAGE b '. > 

3560iJ 123 TO 4622 OF HOMAN CFTR CENA 3600i 3610> 

4090 4100 4110 4120 4130 4140 

TGAGCCGAGT CnTAAGTTC ATTGACATGC CAACAGAAGG TAAACCTACC AAGTCAACCA 
ACTCGGCTCA GAAATTCAAG TAACTGTACG GTTGTCTTCC ATTTGGATGG TTCAGTTGGT 
VSRV FKF IDM PTEG'KPT KST> 

CYSTIC FIBROSIS TRANSMEMBRANE CONDUCTANCE REGULATOR; CODON > 

HYBRID ElA-CFTR-ElB MESSAGE h > 

3620i 123 TO 4622 OF HUMAN CFTR CDMA 3660i 3670> 

4150 4160 4170 4180 4190 4200 

aaccataca;*. gaatggccaa ctctcgaaag ttatgattat tgaga;'.ttca cacgtgaaga. 

TTGGTATGTT CTTACCGGTT GAG;^GC7TTC AATACTAATA ACTCTTAAGT GTGCACTTCT 
K?YK NGO LSK VMII ENS KVK> 

cystic fibrosis TRANSKEM3RAJ^'E CONDUCT^DCE REGULATOR; CODDN > 

h ~ HYBRID El^'.-CFTR-EIB MESSAGE _ h > 

.3680^ 123 TO 4622 O? HUMAN CFTR CIXVk 3720i 3'730> 



4210 •;220 4230 <2<0 4250 4250 

^^GATGACA.T CTGC-CCCTO. GG3GGCCAAA Ta^.CTGTCAA AGATCTCAC^. GCAAAATAO-. 
TTCTACTGTP G;^.CCGGG.^GT CCCCCGGTTT ACTGAC^iGTT TC7AGAGTGT CGTTTTATGT 
KDDI WPS GGQMTVK DLT AKY> 

CYSTIC FIBROSIS TR-^^J^^SMZKBR^-^^ COOT)'JCTANCE RZGUu^TOR; CODDN > 

^^" rrt3?UD £lA-CrTR-Z:i3 !<ZSSAGE h > 

374 Oi 123 TO 4622 Or KJl^J^ CFTR CDMA 37 80i 37S0> 



;270 <280 4290 4300 4310 432C 
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TECC NAl L.£K ISFS IS-P GQR> 

_CySTlC FIBROSIS TRANSMBfflRMffi CONDCJCTAMCE RBGOIATOR; COOQI^ ^ 

h H VBRID EIA-CFTR-EIB M ESSA GE » Z7:=-> 

, 3800i ^123 TO 4622 OF HOMAN CFTR CCM^— 38«0a 38S0> 



4330 4340 4350 O60 ' 4370 .4380 

TGGCCCTCTT GGGAAGAACT GGATCAGGGA ACSAGTACTTT GTTATCAGCT TTTTTGAGAC 
ACCCGGAG^A CCCTTCTTGA CCTACTCCCT TCTCATGAAA CAATACTCCA AAAAACTCTG 
VGLL C RT GSG KSTL LS_A FLR> 
g ygPTe FIBROSIS TRANSMDfflRANE COSDOCTANCE REGULATOR; CODOI?__> 
h • H VBT^ TP ElA-CFTR-ElB MESSAGE ° - 
!>BgQ j 12 3 TO 4622 OF HUMAN CFTR ma. 39QQ1 -XOift^ 

4390 4400 4410 4420 4430 4440 

TACTGAACAC TGAAGGAGAA ATCCAGATCC ATGGTGTGTC TTGGGATTCA ATAACTTTGC 
ATGACTTGTC ACTTCCTCTT TAGCnX:TAGC TACCACACAC AACCCTAAGT- TATTGAAACG 
LLNT E GE IQI D GVS WDS ITL> 
g ygpie FIBROSIS TRANSMEMBRAME CONDOCTANCE REGOZATC^ CXXXHa v 

h m SRID ElA-CFTR-ElBi MESSAGE h ; > 

^3920i ^123 TO 4622 OF HUMAN CFTR CENA 3960i 3970> 

4450 4460 4470 4480 4490 4500 

* • ■» 

AACAGTGGAG GAAAGCCTTT GGAGTGATAC CACAC5AA?VGT ATTTATTTTT TCTGGAACAT 
TTGTCACCTC CrTTCGGAAA CCTCACTATG GTGTCTTTCA TAAATAAAAA AGACCTTGTA 
QQWR KAF GVI PQKV FIT S;GT> 
C YSTIC FIBROSIS TRANSMEMBRANE CONDOCTANCE REGULATOR? CODON___> 

h ■ H YBRID EIA-CFTR-EIB MESSAGE h > 

^39803 123 TO 4622 OF HUMAN CFTR CENA__402Oi_i_4O3O> 



4510 4520 4530 4540 4550 4560 

TTAGAAAWA CTTGGATCCC TATGAACXGT GGAGTGATCA A GAAAT ATGG AAAGTTGCAG 
AATCTTTTTT GAACCTAGGG ATACTTGTCA CCTCACTAGT TCTTTATACC TTTCAACGTC 
FRKN LDP YEQ WSDQ EIW KV A^ 

CYSTIC FIBROSIS TRAKSHSIBRANE CONDUCTANCE P^EGUIATOR; CODON__> 

h ^HYBRID El>.-Cr7R-E13 MESSAGE h > 

40401 123 TO 4622 OF hm=^ CFTR CIKA_4080i 4090> 

4570 4580 4590 4600 4610 4620 

ATGAGGTTGG GCTCAGATCT GTGA.TAG-.-.C AGTTTCCTGG GAAGCTTGAC TTTGTCCTTG 
TACTCCAACC CGAGTCTAGA CACTATCTTG TCA;^AGGACC CTTCGAACTG A-fiu^^CAGCL-^.C 
D E V G L* R S V I E Q F ? G K L D r V L-> 

CYSTIC FIBROSIS TRANSME>3KANE CONDUCTANCE REGL7LAT0R; COEXDN > 

h HYBRID ElA-CrrR-E13 HLSSAGZ h > 

41001 123 TO 4622 Or KUKAN CFTR CDMA 41401 4150> 

4630 4640 4650 4660 4670 . 46£C 

▼ 

TGa^.TGGGGG CTGTGTCCTA AGCO.TC-GCC AC-AGC.i'.GTT G;^.TGTGCTTG GCTAGATCTG 
ACCTACCCCC GACACAGGAT TCGGTACCCC TGTTCGTCAJv CTACACGAAC CG.ATCTAG.AZ 
V D G G C V L S H G H K 0 L M C L A R S> 

CYSTIC FIBROSIS TRAJ\^ShlEhSAAJ'7E CONT>jCTAJ^CE REGULATOR; CODON > 

h HYBRID ElA-CFTR-ElB 1<ZSSAGE h > 

41601 123 TO 4 622 OT HU)-lA)v CFTR CDUA 4 2 001 <2i:> 

4690 4700 4'^10 472C 4730 474! 

':TCTC;'.GTAA GGCGAA.GA.TC TTGCTGLr:*?. krr.,:-.;r.ZU:Z TCrTCATTTC a-.TCC-.CT.--- 
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AAGAGTCATT CCGCTTCTAC AACCACCAAC TACTTCGGTC ACGAGTAAAC CTAGCTCATT 
VLSK AKl L-L1. DEPS AHL DPV> 
- C YSTTr FIBROSIS TRANSMaiBRANE CONDUCTANCE REGULATOR; OOD0M___^> 



h H VBRTP EU-CFTR-EIB MESSAGE ^ 

.4220i ^123 TO 4622 OF HUMAN CFTR CI»WV__4260i.; ^4270> 

4750 4760 4770 4780 4790 4800 



CATACCAAAT AATTAGAAGA ACTCTAAAAC AAGCATTTCC TGATTGCACA GTAATTCTCT 
GTATGGTTTA TTAATCTTCT TGAGATTTTC TTCGTAAACG ACTAAOGTGT CATTAAGAGA 
TYQl I RR T.LK QAFA DCT V1L> 
g ygric FIBROSIS TRANSMEMBRANE CONDOCTANCE RBGUZATOR; CQDai__> 

h H VBRID ElA-CrrR-SlB MESSAGE h y— > 

^ gftftj TO 4622 OF HOMAM CFTR CDMA .4320i ^4330> 

4810 . 4820 4830 4840 4850 4860 

GTGAACACAG GATAGAAGCA ATCCTGGAAT GCCAACAATT TTTQGTCATA GAAGAGAACA 
CACTTGTGTC CTATCTTCGT TACGACCTTA CGGTTGTTAA AAACCAGTAT CTTCTCTTGT 
CEHR lEA MLE CQQF L V I E E N> 
C YSTIC FIBROSIS TRANiSMDfflRANE CONDOCTANCE RB60IAT0R; CODat_-_> 

h H VBRID EIA-GFTR-EIB MESSAGE h I > 

^4340i ^123 TO 4622 OF HUMAN CFTR CENA_4380i ^4390> 

4870 4880 4890 4900 4910 4920 

AAGTGCGGCA GTACGATTCC ATCCAGAAAC TGCTGAACGA GAGGAGCCTC TTCCGGCAAG 
TTCACGCCGT CATGCTAAGG TAGGTCTTTG ACGACTTGCT CTCCTCGGAG AAGGCCGTTC 
KVRQ YDS IQK LLNE RSL.FRQ> 
C YSTTC FIBROSIS TRAMSMEMBRANE CWDOCTANCE RBGOIATOR; CODa(NL___> 

' HYBRID EIA-CFTR-EIB MESSAGE h ^> 

4400i 123 TO 4622 OF HUMAN CFTR CDNA 44401 44 50> 

4930 4940 4950 4960 4970 4980 

CCATCAGCCC CTCCGACAGG GTGAAGCTCT TTCCCCACCG GAACTCAAGC AAGTGCAAGT 
GGTAGTCGGG GAGGCTGTCC CACTTCGAGA AAGGGGTGGC CTTGAGTTCG TTCACGTTCA 
A ISP SDR VKL FPKR NSS KCK> 

CYSTIC FIBROSIS TRANSMEMBRANE CONDUCTANCE REGUIATOR; CODON > 

^h^ HYBRID EIA-CFTR-EIB IGiSSACZ - h > 

4460i 123 TO 4622 OF KUl-IAN CFTR CEMA 4500i_, 4510> 

4990 5000 5010 5020 5030 5040 

CTP-^GCCCCA GA.TTGCTGCT CTGA-rAGAGG AG-.CAGA.AGA AGAGGTGCAA GATACAAGGC 
GATTCGGCGT CTA^CGACGA GACTTTCTCC TCTCTCTTCT TCTCCACGTT CTATGTTCCG 
SK?0 lAA LXE ETEE EVQ DTR> 

CYSTIC FIBROSIS TRA>?SMEK3RANE COTOUCTAJ^CE REGULATOR; CODON > 

HYBRID ElA.-CrTR-E13 MESSAGE h > 

'.520i 123 TO 4 522 OF r2P/J^^ CFTR CI2^ 4 560i <57C> 

5050 5060 5070 5080 5090 5100 

TTTAG^a^GC AGC-.T>^J^.TG TTGACATGGG ACATTTGCTC ATGGAJ^.TTGG AGGTAGCGG.^. 
A^Ji.TCTCTCG TCG7ATTTAC AACTGTACCC TGTAAACGAG TACCTTA.^CC TCCATCGCCT 

> 

h rrr3rLID ElA-CFTE- ElB MESSAGE h > 



4 580i 123 TO 4 622 OT KUht-JC CrTR CDX- 4 520i > 
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5110 5120 5130 5K0 5150 S160 

TTGAGGTACT GXAATCTCTG GGCGTGGCTT AXCCGTGGGA f^^^'^j^l^l IJSSS^S 
AACTCCATGX CnTAOiCAC CCGCACCGWV TICCCACCCT TICTTATATA TTCOCCCCC 

h H ViSRID .EiA-CFTR-ElB MESSAGE __n > 

10__C___P1B 3* UNTRANSLATED SEQOENCES__^50_Srj — —fiO > 

" k 1 0 • > 3' INTRON .K 40 K 50 > 

5170 5180 5190 5200 5210 5220 

TCTCATGTAG TTTTCTATCT GnTOCAGC ACCCCCC^ iTSSS^SSt iSSTUSt 
AGAGTACATC AAAACATAGA CAAAACGTCG TCGGCC3GCGG TACTOCCGCT TGAGOAACT 

MSA M 0 

iX PK0TE3K f HE ^ 

h KVBRXD EIA-CFIR-EIB MESSAGE . -h 

1 1 • T X MRNA^_ 1 _> 

70 o PIB 3' UNTRANSLATED SEQUENCES 110 — g 120._> 

60_1E1B 3 • INTROM ^80 > 

5230 5240 5250 5260 5270 5280 

TGGAAGCATT GTCAGCTCAT ATTTGACAAC GCGCATGCCC gCATCGGCCT GOTTGOTOl 
ACCTTCGTAA CACTCGAGTA TAAACTGTTG CGCGTACGGG GGTACCCGGC CCCACGCAGT 
GSI VSS YLTT RM P PWA G VRQ> 

JX PROTEIN (HEXON-ASSOCIATED PROTEIN) : CODOl?«STAKr=l > 

JKBKID ElA-CFTR-ElB MESSAGE — 1 h > 

'l JX MRNA^ ^3 1 : ^ 



130 a E 1b""3' UNTRANSLATED SEQUENCES ^170_^ j:_180 > 

5290 53 0 0 5310 5320 5330 5 3 4 0 

GAATGTGATG GGCICCAGCA TOATCGTCG CCCCGTCCTG CCCGCAAACT CTACTACCTT 
otISSaC SgAGGtSt AACTACCAGC GGGGCAGGAC GGGCGrTTGA GATGATGGAA 
NVM GSS IDGR PVL PAN STT L-> 

IX PROTEIN (HEXON-ASSCXIATED PROTEIN) ; CODON_STAKr=l > 

• n ^HYBRID EIA-CTTR-EIB MESSAGE Jj ^> 

1 1 IX MRNA 1 1 : ^> 



190 Q S:i3~3' UNTRANSLATED SEQUENCES 230__g ^240 ^> 

5350 5360 5370 5380 5390 5400 

GACGTACGAG ACCGTGTCTG GAACGCCGTT GGAGACTGC^ GCCTCCGCCG CCGCTTCAGC 
CTGGATGCTC TGGCACAGAC CTTGCGGC^A CCTCTGACGT CGGAGGCGGC GGCaXAGTCG 
•pvT- TVS GTPL ETA ASA AAS A> 

PRCTZm (HZXON-P^SOCIATED PROTEIN) ; C0D0N_START=1 > 

h KYsRID ElA-CrrR-E13 MESSAGE h > 

1 1 _IX HRNA 1 .1. > 



250 g E13 3- UlvTRANSLATED SEQUENCES 290 — g 300_> 

5410 5<20 5430 5440 5450 5460 

w 

CGCTGC^GCC ACCGCCCGCG GGATTGTGAC ja^CTTTGCT TTCCTGAGCC CGCTTGC---.G 
GCa?.CGTCC-G • TGGCGC-GCGC CCTrACACTG ACTGAAACGA AAGGACTCGG GCGAACGTTC 
AAA TAR G.IVT D?A FLS PLAS> 

TX PRCTZrN ( KEy.ON- ASSOCIATED PROTEIN) ; C0D0N_START=1^ > 

V, r-r/BRID ElA-CrTR-ElB MESSAGE h > 

1 1 IX K?J>i^ 1 i : > 

310 o E13 3" GI-JTRANSU^TED SEQUENCES 350 — g 360 > 



5470 5<80 5490 5500 5510 552C 

CA.GTGC^.GCT TCCCGTTCi.T CCGCCCGCGA. Ta-.CivAGTTG ACGGCTCTTT TGGCA.-i.-.TT 



—SO*" 

ClCACGTCCy. AGGGCAXGTA GGCCGCCGCT ACTCITCAAC TGCCCAGJAA ACCCTGTTAA^ 
SAA crS sard DKI* TAL L. A«i<> 

__IX PR(D?ElS (SeXON-ASSC^IATED PROTEIN) ; C0D0N_STARr«3 ^> 

h H YBRID ElA-CTTR-ElB MESSAGE n ^> 

1 ^ ^3 JK MW » , rrTT-i 



3 70_ g E 1B 3' UMIRMJSIATED SEQOEMCES 410^ 420_> 

5530 5540 5550 . 5560 5570 5580 

GGArrcrrrG acccgggaac ttaatctcg^ "CTCagcag crcjjgjjc JGC^cagca 

CCTAAGAAAC TCGCCCCTTO AATTACAGCA AAGAGTCGTC GACAACCTAG J«^^GT 
DSL TRE LNVV SQQ LLD liK.QQ> 

^IX PROIEIN (HEXC3N-ASS0CIATED PROTEIN) ; CODQN»STARr=X _!__> 

h H VBRID EIA-CFTR-EIB MESSWSE ; tL * 

1 ■ TV MIOOl ^ ■'• 



\ 4 3 0 i fiw '^' llMTHAMSIATED iSEQOElCES ^C70__g _480 — > 

5590 5600 5610 5620 5630. 

GCnrrCTGCC CTCAAGGCTT CCTCCCCTCC CAATGCGCrrt taaaacataa ataja 
CCAAXGACGG GACTTCCC5AA GGAGCSGGAGG GTrAC(5CCAA ATTirGTATT TATIT 
VSA LKA S'S. PP NA V •> 

DC PROTEIN (HEXON-ASSOCIATDJ PRDIEIM) t C ?- 

h H VBRIP EIA-CFTR-EIB MESSAGE J> > 

1 1 D C 1 1 ^> 



490 g E lB 3 ' UOTRMJSLATED SEQUENCES 530 — S. 
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Nadeotide Sequence Analyas of Ad2-<DRF6/P6K'-CFTR 

LOCOS AD2-ORP6/P 36335 BP DS-VSk 

DEPZNITION - 

ACCSS^XON 

XSVMORDS 

fiomtcfi. - 

FSHTORfiS From T&/^>an 

frag 12915 36335 
frag 35069 35973 



pre-wg > 3S973 < 35069 (C) 



xvs 


35794 


35084 


(C) 


xvs 


35794 


3517S 


(C) 










IVS 


35794 




ICJ 


irvs 


35794 


3S343 


(C> 


IVS 


35794 


3S501 


(C) 


lYS 


35794 


35570 


(C) 


IVS 

frag 
pre-msg 


35794 
35978 
36007 


3S766 
36335 
< 35978 


(C) 
<C) 


cpt 


36234 


36335 




ir^g 
pept 
pept 


^ 12915 
< 28478 
28478 


35054 
28790 
28790 


3 
1 


nRNA 


29331 


< 12915 


(C> 


pre-iBsg 


< 12915 


16352 




pre-wg 


< 12915 


20208 




pre--iz^g 


< 12915 


24682 




pre-iasg 


< 12915 


30462 




pre-SDsg 


< 12915 


35037 





Description 

10676 to 34096 o£ Ad2-E4/0StF6 
33178 to 34082 of Xd2 ceq. 
E4 bRIA. CNueleie Acids Res. 9, 1675-1689 
(1981) J/ IJ. Mol. Biol. 149, 189-221 
(1981) 1 « (Nucleic Acids Hes. 12, 3S03-3S19 
(1984)1, [Unpublished (1984)) tSpXit] 
E4 BUNA, intron D7 (J. Virol. SO, 106-117 
(1984)), (Nucleic Acids Res. 12, 3503-3519 
(1984) J , (Unpublished (1984) J 
E4 bAKA intron D6 (Nucleic Acids Res. 12, 
3503-3519 (1984)] 

B4 »RNA intron D5 (J. Virol. 50, 106-117 
(1984) J 

E4 BiRMA intron D4 
(1984)3 

£4 BfiHA intron D3 
(1984)1 

E4 oRlVi intron D2 
(1984) 1 

E4 idKNA intron Dl (J. Virol. 50, 106-117 
(1984) ] 

E4 nRNA intron D (J. Virol. SO, 106-117 (1984)1 

3SS80 to 3S937 of Ad2 seq 

E4 mRNA (Nucleic Acids Res. 9, 1675-1689 

(1981)1, (J. Mol. Biol. 149, 189-221 

(1981) 1 , (Wicleic Acids Res. 12, 3503-3519 

(1984) 1 , (Unpublished (1984)} [Split] 

inverted terminal repetition; 99.54% (fiiochem. 

Biophys. Res. Coaanun. 87 , 671-678 (1979)1. [jr. 

Mol. Biol. 128, 577-594 (1979) J 

1 to 32815 of Ad2 seq [Split] 

33K protein (virion morphogenesis) 

33K protein (virion morphogenesis): 

codoiL-Startsl 

Biol. Chem. 257, 1347S-13491 



IJ. Virol. 50, 106-117 
(J. Virol. 50, 106-117 

13. Virol- 50. 106-117 



E2b bJWA [J 
(1982)1 (Split) 
major late mRNA LI 
189-221 (1981)1. [J 
[Split! 

major late nRNA L2 



(alt.) (J. Mol. Biol. 149, 
Virol. 48, 127-134 (1983)1 



_ (alt.) (J. M'-l. Biol. 149, 

189-221 (1981)1. [J. Virol. 38, 469-482 
(1981)1. [J. Virol. 48. 127-134 (1983)1 [Split] 
major late mRNA L3 (alt.) (Nucleic Acids Res. 
9 1-17 (1981)1, (J. Mol. Biol. 149. 189-221 
(1981)1. (J. Virol, 48. 127-134 (x983)] [Split] 
major late mRNA L4 (alt.) (J. Mol. Biol. 149, 
189-221 (1981)1, IJ. Virol. 48, 127-134 (1983)1 
[Splic] 

oaior late mRNA (alt.) (J. Mol. Biol- 149. 
189-221 (1981)1. IJ. Virol. 48. 127-134 (1983)) 
(Split) 
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Nndeofide Sequence Analysis (oonL) 



tSeSh, < 12915 



13278 



XV8 


< 


12915 


163&8 


IVS 


< 


1291S 


18754 


xvs 


< 


1291S 


20238 


rvs 


< 


12915 


21040 


IVS 


< 


12915 


23888 


IVS 


< 


12915 


26333 




< 


12915 


13005 


8KA 


< 


12915 


13005 


???? 


< 


12915 


13262 



pept 
pept 



signal 

pept 

pept 

pept 
signal 

pept 

pept 

7777 

signal 

pre-'Bisg 

pre-sieg 

pre-rBisg 



13279 
14547 



16331 

16390 

18112 

18778 
20188 

20240 

21077 

12915 

246S7 

28193 
2819S 
29330 



14526 
16304 



16336 

18105 

18708 

19887 
20193 

20992 

23983 

24631 

24662 

24659 

246S9 

24659 



»a4or late inRMA intron (precedes 52,55K URMA: 
^tL SNar(Cell 16, 841-850 (1979) J. [Cell 

(19?9)1.[J. Mol. Biol^l34, 143.1S8 
fl979)l IJ. Mol. Slol. 135, 413-433 
al?9 : H;ture 292, 420.426_ (1981)3 t^lit] 
na-ior Ute »RMA intron (precedes penton nRNA; 
SrL^flS. Virol. 48, 127-134 (1983)) 

S«*^late jKRKX intron (precedes pV rtRNA; 2nd 
S r^^^Biol. Chea. 259, 13980-13985 

la^^ i»tron (precedes pVl 1st 
S^) IJ. Virol. 38, 469-482 (1981)1 [Split) 
iajor late »RNX intron (precedes hexon 
2nd L3 aiRKX) IProc. »atl. Acad. Sei. O.S.A. 75, 
sS22-58S6(l978)),[Cell 16, 841-850 (1979)) 

iSii^'iate »BNA intron (precedeo 23K ttRNX; 3rd 
S BiRHX) [Nucleic Acids Res. 9, 1-17 (1981)] 

Sit^late iiiR»» intron (precedes lOOK iiKNA; 1st 
S^Sl^riVixology 128,^140-153 (||B3)) [^iitl 
VA I RNA (alt.) tJ. Biol. Cbem. 252, 9043-9046 

4i7ik^1?it!) ta. Biol. Che». 246, 6991-7009 
(1971) J. t J, Biol- Che». 9047-9054 
(1977))* [Proc. Natl. Acad. Sci- U.S.A. 77, 
2424-2428 (1980)1 (Split) ^ „ - « 

VA II UNA tProc. Natl. Acad. Sei. U.S.A. 77, 
3778-3782 (1980) J , (Proc. Natl. Acad. Sci. 
u:s.A. 77. 2424-2428 (1980)1 [Split) 

1 S2.55K protein; codon_etart«sl 

1 Ilia protein (perlpentonal hexon-associated 
protein; splice sites not sequenced} ; 
codon_start=l 

najor late atfttW Ll poly-A signal (putative) 

1 penton protein (virion con^Kwent III) ; 

codon_start=l 
1 Pro-VII protein (precursor to naaor core 

crotein) ; codon_6tart=l ^ .. , 

1 prpSein (minor core protein); codon.8tart=l 

aajor late wRNA 1-2 polyadenyation signal 

(putative) 49.94% 
1 pVI protein (hexon-associated precursor) ; 

codon_start=l ^ 

1 hexon protein (virion component II), 

codoi3L_startssl ^ ^ , 

23K protein (endopeptiaase) ; codon^tart«l 

iSii'^Lte inRNA 1,3- polyadenyation signal 
(putative); €2.38* 

E2a late »BNX (alt.) U- Mol. Biol. 1*9. 

S'Ste^i^A (alt.) .tNucl.ic Acids Res. 12, 
. 3503-3519 (1984)1. (Urvpublished (1964)3 
(C) early nJ^A (alt.) (J- Mol, Biol. X49, 



(C) 

(C) 
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Nudeodde Sequence Analysis (conL) 





29331 


24659 


signal 


24683 


24678 


pept 


26318 


24729 


IVS 


26953 


26328 


p^pt 
ZVS 


26347 
29263 


28764 
27031 


lys 


28124 


27211 


IVS 


28791 
28993 
294S4 


28992 
> 29366 
30137 




29848 
30220 


33103 
30614 


dlgnal 


30444 


30449 


sigxial 


< 12915 


32676 


pept: 


31051 


31530 


pept 
signal 


31707 
32008 


32012 
32013 


IVS 


32822 


33268 


.signal 


33081 


33086 


7777 


< 12915 


35017 


elgnal 


35013 


35018 


pre-rmsg 


35054 


> 35041 


frag 
rpt 


1 
1 
1 


12914 

> 356 

> 103 




< 10 


103 


frag 
frag 


357 
915 


379 
> 923 



189-221 (1981)] 
(C) E2a early (alt.) (J. M61. Biol. 149« 

189-221 (1981) J ^ 
(C) £2a BiRKA polyadenyation eignal on coop steaad 

(putative); 62.43% 
(CI SBP protein {OUi binding or 72K protein) ; 

oodoztJBtartel 
(C) sdRNA intron B tMueleic Acids Res* 9« 

4439-4457 (1981)] 
1 lOOK protein (hexon assezobly) ; codon^starCsl 
(C) E2a early inRMA intron k (Cell 18« 569-580 

(1979)] 

(C) E2a late rtRNA intron A [Virology 128, 140*153 

(1983) ] 

33K-pept intron IJ. Virol. 45, 251-263 (19S3)] 
1 33K protein (virion sic^rphogenesis) 
1 pVIII popotein (hexen-associated precursor) ; 

codoa.8tartsi 

£3*2 85.88% [Gene 22, 157-165 (1983)] 

major late nRK2i intron (*x' leader) [Gene 22 « 
157-165 (1983)], [a. Biol, Chem. 259, 
13980-13985 (1984)] 

iDajor late inPi^ polyadenyation signal; 
(putative) 78.48% 

major late mRNA intron (*y' leader) [J. Mol. 

Biol. 135, 413-433 (1979)], [J. Virol. 38, 

469-482 {1981)1, (EMBO J. 1, 249^54 

(1982)1, (Gene 22, 157-165 (1983)] [Split] 
1 E3 19K protein (glycosylated membrane protein) ; 

codon-.start»l 
1 E3 11. 6K protein; codon^startal 

£3-1 niRKA polyadenylation signal (putative) ; 

82«69% 

major late mRKA intron ('z' leader) [Proc. 
Natl. Acad. Sci. U.S.A. 75, 5822-5826 

(1978)1, [Cell 16. 841-850 (1979) ], [EMBO J- 1, 
249-254 (1982)1, [Gene 22, 157-165 (1983)] 
£3-2 mKKA polyadenyation signal; 85.82% 

(putative) 

fiber protein (virion coi^ponent TV) ; 
codorustartsl (Split] 

major late mRNA LS polyadenyation signal.; 
(putative) 91.19% 
(C) £4 mRMA (Nucleic Acids Res. 9, 1675-1689 
(1981)1, (J- Mol. Biol. 149, 189-221 
(1981)1, [Nucleic Acids Res. 12, 3503-3519 

(1984) 1, (Unpublished (1984)1 [Split] 
1 to 12914 of pAa2/PGK-CPTR 

1 to 357 Ad2 

inverted terminal .repetition? 0.28% [Biochem. 
Biophye. Res. Comraun. 87, 671-678 (1979)1. [J. 
Mol. Biol. 128, 577-594 (1979)) 
inverted terminal repetition; 0.28% [Biochea. 
Biophye- Res* Coanmin. 87, 671-678 (1979)], [J. 
Mol. Biol- 128, 577-594 (1979)1 (Split) 
linker segment 

polylinker cloning sites [Split] 
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Nttdeodde Seqnoice Analysis (cont) 





< 


924 


> 


954 




< 


S567 


> 


12914 


signal 




380 




914 


frag 


< 


955 


> 


958 


< 


5501 




5522 


signal 




5523 




5555 
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polylinker cloning sites [gplit] 
3328 to 1068S of Ad2 [Split] 
pgX promoter , . % 

polylinker cloning sites [Split] 
polylinker oloniitf sites [Split] 
syn. BGH poly A 
linker [Split] 

linker [Split] 

920 to 5461 of PCMV-CPTR-936C 

jnistake in published sequence of Riordan et 

al. C not A is correct = H to H a-a. change 

936 T to C mutation to inactivate cryptic 

bacterial proiooter. Silent a mino a cid cOiange 

polylinker segeaent frcan pCMV-CPTR-936C 

(Rc/CMV-Invitrogen Spel-BstXl) [Split] 

linker segment from pCM7-cn)R-936C. Originally 

SalZ/BstXZ adaptor oligo 1 499D S 

linker segement frott pCMV-CPl!R-936C. 

Originally from pMT-CFTR constrMCtion oligo 

1247 FO •Sal I to Aval sites. 

123 to 4622 of HlttWFlR 

cystic fibrosis transmeabrane conductAnce 

regulator 7 codon^tartcl 

9786 O 7952 T 0 OTHER 



ORIGIN 



'S2-ORP6/P Denoth- 36335 Sep 16, 1993 - 08sl3 H€ Check: 1664 
^ ^llS:£Sa^^ GAACCCAXTA TCSATAATGAO GGGCTTG^ 

121 GATCTOtXrAA GTOTGGOGGA ACACATGTAA GCGC^Jg ^JJ^^ 

181 GTOTCCGCCG GTOTATACGG GAACTGACAA SSSSSS SSJ^^ 

241 TAAATWWOG CGTAACCAAG TAATGTITCG CCATmCGC ^^^^^^ ^^SSSS 

d 

1 sss SSS^ 

til SSSSS cSoOGAGCA ATGGCAOCCSC GCCGACCGCO ATOGGCTCSTC 

^SS^S aSgCGCOCC GAGAOCAfiOO CCCOOOWiOCS OOCGGTCCGG 

Sill TcsnkiiaccTri ctstSAOCGCAC GTOOGCAGTC GGCTCCCTCG TTGACaSAAJT CACCGACCTC 
111 SaaScGT ACGCCTW5TA TTTAAATCGT ACGCCTAGTA 

ll\ ISSScCA ctct^tS GATATCAAAG TCGACOOTAC COGAGAGACC ATCCAGAOGT 

1081 5Saqgaaagg atacagacag cgcctcgaat tgtcagacat ataccaaatc ccttctotto 

ISSSJJS? ->\AAATlt3G AAAGA3AAT0 CGATAGAGAO CTOOCTTCAA 

\ltl ik^SaS?C GGCOAItSlTT -mCTOGAGA TTTATGTTCT 

1201 AGAAAAATCC TMACTOOT ^^CZj^rX—Ji eCAAA(5CAGT ACAGCCTCTC OTACTOGGAA 
1261 ATGGAATC^ ^AJi^TA GGGGJJJTCA ^J^^ JtCTATCGCG ATTTATCTAC 

Hs ^^^E ssss 

E Jf^S s=s 

IS6I TTTCCAACAA CCTCAACAAA J^J^JX^ tAATCTOGOA GTTCTTACAG GCGTCTOCCT 

ilJi ^SSSJ SSS^SS ?l= ISSSS iSSSSS 



rvi/usyj/iloo/ 
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Nn^eotide Se^oaice Analysis (conL) 

1861 lX3MraGAA^^ ACAGMkCIGX AACTCACTOG OAA GCCftQCC TATOlGAGKr 

1921 ACnOATAG CIC2M3CCTIC TTCTTCTCA6 OGTTCTTTGT GOTGTTTTTA TCT6IGCTTC 
1981 CCTATGCACT AATCAAAlWA ATCATCCTCC OOAAAATAXT CACCAOCAOC ^^TTCIOCA 
2041 ITOTTCTOCG CA1G000GTC ACTOGOCAAT TTC C CTOOQC TOTACAAACA f^XATQACT 
2101 CTCTTGGAGC AATAAACAAA ATACAGGATT TCTTACAAAA CCAACAATAT AACACATTOO 
2161. AATAOAACTr AACOACTACA OAACTAOTOA OXWACAATCT AACMXXSTTC TCOGftCCAOG 
2221 CATTTGOOGft ATTATTTfiAS AAAGCAAAAC AAAACAA1AA CAATA QAAAA ACTICTAA!DS 
2281 OTOATQACAO CCTCTTCTTC AOTAATTTCT CACTXCTXOG TACTOCXGTC CXOAAAGATA 
2341 TTAATTTCAA GATACSAAAGA CQACAOTXGT T060GGT1OC TGGATCCACT OGftOCAOGCA 
2401 AGAClOCACT TCTAATGATG ATTATOOOM AACTCGAOCC TTCAGAGGGT AAAATTAAGC 
2461 ACAGIOGAAG AATTICATIC ICTTCTPAOT TTrCCTGOAT TAT60CTG0C ACCATTAAAG 
2S21 AAAATATCAT CTTTOGTOOT TOCTATOAXO AATATAOATA CAGAAGOGTC ATCAA AGCAT 
2581 aCCAACrTAGA ASASGACATC TCCAAGTTT6 CAGAGAAAGA CAATMCftGTT CSTOGASAAG 
2641 GTOGAATCAC AClGAfiiWGA GOTCAACGAO CAAGAATTTC TCTAOCAAQA GCAGTATACA 
2701 AAGATCCTOA rTTCTATTOA TTAOACTCTC CTTTTGGATA CCTaGATOTT CTAACA GAAA 
2761 AAGAAATATT TCAAAGCTOT OICTOTAAAC TOATOGCTAA CAAAACTAGG ATTTTOGTCA 
2821 CTTC1!AAAAT GGAACATTXA AASAAAGCTO ACAAAATATT AAaJTTXOCMf GAAOGTAGCA 
2881 GCTATTTTTA TOOGACATTT TCASAACTCC AAJATCT»CA OOCASACTXT A0CTCAAAAC 
i94l -TCATOGGATC TGATTCTTTC OACCAATTTA GTGCAaAAAG AAGAAATTCA ATOCTAACTS 
3001 AGACCTIACA COOmCXCA TTAGAAGGAG ATCCTCCTOT CTCCTGOACA GAAACAXAAA 
3061 AACAAUGTTT TAAACAGACT OGAOAGTTKS OGGAAAAAAG GAAGAATTCT ATTCTCAAIC 
3121 CAA1CAACTC TATAOOAAAA TmCCATTG TCCAAAAGAC TCCCTTACAA ATGAATOGCA 
3181 TCGAASAGGA TTCIGAtGAG CCTTTAGASA GAAGGCTSIC CTTAGTACCA GATTCTGAGC 
3241 AGOGASASec CATACTOCXST CGCATCAG06 TGATCAQCAC TO6CCCCAC6 CTTCAOOCAC 
3301 GAAGGAOGC21 GTCTOTCCTO AACCTOATGA CACACTCAGT TAACXrAAOOT CAGMjCATTC 
3361 AGOGAAAGAC AACAGCA3CC ACACGAAAAO TGTCACIXSOC CCCTCAGOCA AACTTOACTG 
3421 AACIOQATAr ATATICAAGA AOGTTATCTC AAGAAACOGG CTTGGAAATA AGTOAAGAAA 
3481 TTAAOGAASA AGACTXAAAG GAGTCCCTTT TT6ATGATAT GGAGAGCATA CCAGCAG TOA 
3541 CTACATGGAA CACATACCTT OOATATATTA CTOICCACAA CAGC TOAAOT TCTOTOCTAA 
3601 TTTGGTGCOT AGTAACTTT T CTOGCAQAGG TGGCTGCTTC TTTGen»XG CTOTOGCTCC 
3661 TIGGAAACAC ICCTCTTCAA GACAAAGGGA ATAGTACTCA TAGTAGAAAT AACAGCTATG 
3721 •CAGTCATTAT CACCAGCAO: AOTTOGTATT ATGrTGTTTTA CATTTACGTC GGAGTAGCCG 
3781 ACACTTXGCT TGCTATGGOA TTCTTCAGAG GTCTACCACT GGTOCATACT CTAATCACAG 
3841 TCOOSAAAAT TTTACACCAC AAAATGTTAC ArTCTOTTCT TCAAOCACCT ATGT CAACC C 
3901 TCAACACGTT GAAAGCAGGT GGGATTCTTA ATAGATTCTC CAAAGATATA GCAATTTTGG 
3961 ATGACCTTCT GCCTCTTACC ATArTTGACT TCATCCAGTT CTTATTAATT GTGATTGGAG 
4021 CTATA5CAGT aXSWSOCAOTT TTACAACCCT ACATCTTTOT TGCAACAGTG CCAGTCATAG 
4081 TCGCTXTTAT TATOTTOAGA GCATATTTCC TCCAAACCTC ACAGCAACTC AAACAACTCO 
4141 AATCTOAAGG CAGGAGTCCA AmTCACTC ATCTTGTTAC AAGCTTAAAA OGACTATOGA 
4201 CACnCGTCC COTCGGACCG CAGCCTTACT TTCAAACTCT GCTCCACAAA CCTCTOAATT 
4261 TACATACTCC CAACTOGTK: TTGTACCTOT CAACACTGCG CTCGITCCAA ATGAGAATAG 
4321 AAATOATTTT TCTCATCTTC TICATTGCTG TTACCTTCAT TTCCAnTTTTA ACAACAGGAG 
4381 AAGQAGAAGG AAGAGTTOGT ATTATCCTCA COTTAGCCAT GAATATCATG AGTACATOGC 
4441 ASTCQGCrGT AAACTCCAGC ATAOATOTOS ATAGCTTGAT GCGATCTGTC AGCCGAGTCT 
' 4501 TTAAGTrcAT TCACATCCCA ACAGAAGOTA AACCTACCAA CTCAACCAAA CCATACAACA 
4561 A1GGCCAACT CTCGAAAGTT ATGATTATTG ACAATTCACA CGTOAAGAAA OATCACATCT 
4621 GOCCCTCAGG GGOCCAAATG ACTGTCAAAG ATCTCACAGC AAAATACACA GAAGGTGGAA 
4681 A1GCCATATT AGAGAACATT aCCTTCTCAA TAAGTCCTGG CCAGAGGGTC GGCCTCTT^ 
4741 GAACAACTOG AaX^OOGAAG AGTACTTTOT TAICAGCTTT TTOGAGACTA CTOAACACTG 
4801 AAGGAGAAAT CCAGATCGAT OGTGTGTCTT GOGATTCAAT AACTTTGCAA CAGTGGAGGA 
4861 AAGCCTPTGG AGTOATACCA CAGAAAGTAT TTAmTTTC TGOAACATTT AGAAAAAACT 
4921 TCGATCCCTA TCAACAGTCG AGTGATCAAC AAATATCGAA AGTTGCAGAT GAGOrrGGGC 
4981 TCAGAIXTTCT GATAGAACAG TTTCCOGGGA AGCTTGACTT TGTCCTTCTG GMX3GGGGCT 
5041 OTOTCCtAAC CCATCGCCAC AAGCAGTIGA TGTGCTTCGC TAGATCTGTT JJCAGTAAGG 
SiOl CGAAGATCTT GCTCCTIGAT GAACCCAGTG CTCAirrGGA TCCAGTAACA TACCAAATAA 
5161 TTAGAAGAAC TCTAAAACAA GCATTTCSCTC ATTGCACAGT AATTCTCTGT GAACACAGGA 
sill SSSSaaJ GCTGCAATCC CAACAATTTT TOCnCATAOA AGACAACAAA GTGCGOCAGT 
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Nttdeotide Sequence Aasiysis (coat) 



5281 ACOATTCCAT CCAGAAAC9X3 CTGAACaAGA OGACCCTCTT COGOCAAQOe A3CAOCOCCT 
5341 CCGACW3GGT CAACCTCTTT CCOCACOOGA ACICAAGCAA CTC5CAAOTCT AAGCCCXAOA 
5401 TTCCTGCICT fiAAAOAGOAa ACAOAACSAAG ACOTCCAAfiA •PACAAGOCTT ^^AGAGCAG 
5461 CATA2UITOTT GACAIQaOAC ATTTOCTCAT OOAATOGGAfi AAAanGXAOG «TAeGAOGC 
5521 GTAATAAAAT GAfiGAAATXG CATOGCATTC TCTSAOGCGT TAOGOCGGAA OCTCCTttAfiC 
5581 TACGAXOAeA CCOGCACCAG GTSCASACCC •PCOGAGTCTO GOGg TAXX CA TATTACGAAC 
$641 CAQ C CTOTCA TOCIGaMTGT OACCCAGGAfi CTCSAGGCCOe ATCACTTCOT QCTSGOCTOC 
5701 ACCCGCGCTO AOTTiaGCTC TAGOOATOAA GATACA<3ATT GA GQTAC TGA AATO T OTGSS 
5761 aSTCGCTTAA GGGIOGGAAA GAATATATAA OGTOGOSGrrC TCAIOBAOrT TTCTAacaSCT 
5821 mCCAeCAC CCGOCGCCAT GAGOCCCAAC TOCXTTCATC GAAGC ATTCT CiAOCTCATAT 
5881 TTGACAACGC GCATCCCCCC ATOOOCOOOO GTCCGXCAOA ATGIGATO09 CTCCAOCATT 
S»41 GATOOTOOCC CCGTCCIGCC CGCAAACTCT ACOSICCTTGA CCTACGAOAC CGTOTCTOC3A 
6001 A06GC0TTG0 AGACTeCAlOC CTOOGOOOCC GCnCAGCOS CTGCAGCCAC OGOOOGOQOe 
6061 ATIGTGAC3X3 ACTTIGCTTT CCTfiAOCCCO CTTGCAAjSCA GTOCAGCl<ax: OOGTTCAKC 
•6121. GGCCCSCGATG ACAAGTTOAC GOCTCTTTTO GCACAATTGG ATTCTTIGAC COOOGAACTT 
6181 AATOTGOTTT CTCAGCAGCT GTTOGATCTO CGCCAGCAGG TTTC TgCCC T GAAGQC TOCC 
6241 ICC C CTiXC A ATOOOOrm. AAXCASMAT AAAAACCAGA CTCTCTTIOG ATTTIGATCA 
6301 AiSCAAGTGTC TTGCTOTCTT TAlTTAGGOCr TITTfiOQOGC GOOGTAOOCC COOQACCACC 
6361 GGTCTOGGTC GTTOAGOGIC CTOTOTATTT •raOCCAGGAC ClOOTAAAGCS TCSACTCTOGA 
6421 lOTTCAfiATA GATOOGCATA AGCOSGTCTC TOaOGTGGAG GTAlSCACCAC TC3CAGAGCTT 
6481 CATOCTOCJOO GGTOGTOTT g TAGAJGATCC AGTCGTACCA OOAGOCCTOG GOCSTOGTGCC 
6S41 TAAAAAaerc TTTCAOIAISC AAOCTOATTO CCAOOSeCAO CCCCTTOGIXS TAAGTOTTTA 
6601 CAAAGCGOTT AAfiCTCOGAT GGOTOCATAC OTOOGCAaAT OMKSOOaC mWOACTOTA 
6661 TTTTTAGGl-r GGCTATOTTC CCAGCCATAT CCCTCOOOOO AWCATCTTO TOCAGAAOCA 
6721 CCAOCACACT OTATCCC5G1G CACTTGOCaA ATTTSTCMK3 TAOCTOAGAA eGAAATGOGT 
6781 GGAA8AACTT GGAGAOGCCC TTGTI»CCTC OGACATTTTC CATCCATTCG TCCATAATCA 
6841 TOSCAATOGC OCCAOOOGCG COOSCCTCWO OBA AGATAT T TCPSOGAgCA CTAAOGTCAT 
6901 AGTTOTCTTC CAQQATGAGA TCGTCATAfiG CCATTTTTAC AAAGCGOQG6 CGGACCGTSC 
6961 CAGACTOCGG TATAATGGTT CCATCCQGCC CAGOaOCOTA OTTACCCTCA CASATOTCCA 
7021 TTTCCCACGC TTKSAGTTCA GATOGOGGGA TCATGTCEAC CTGCG GGGCG ATOAAGAAAA 
7081 CCXSOTTCOOG GGTAGGCOAG ATCAGCTOGG AAGAAAGCAG GTTCCTCAGC AGCTGCXSACT 
7141 TACCGCAOCC CSGfTOGGCCCG TAAATCACAC CTATTACCGO CTOCAACTOC TAGTOAAGAC 
7201 AGCK3CAGCT GCCOTCATCC CTGAGCAGGG OGGCCACTTC GTOAAGCATG TOCC TCACTT 
7261 GCATCOTTTC CCTCACCAAA TCC6CCAGAA OOCOCTCGCC COCCAQCGAT ACCAGOT CTT 
7321 GCAAGGAAGC AAAOTTOTTC AACGGTTTGA GGCCGICOGC CGTAOGCATG CX1TTGAGCG 
7381 TTTGACCAAG CAiGTTCCAaa CGGTCCCACA GCTCGGTCAC GTGCTCTACG GCATCTCGAT 
7441 CCAGCATATC TCCK50TTTC GCGOGTTGGG GCGGCTTTCG CTOTACGGCA GTACTCGGTG 
7501 CTCGTCCAGA CGGOCCAGGG TCATCTCTTr CCACGGGCCC AGOGTCCT06 T CAGC GTAOT 
7561 CTGGGTCACG GTOAAGOGGT GCGCTCCGOG CTGCGCGCTG CCCACGOTeC OCnCACGCT 
7621 GGTCCTGC1T3 GTOCTOAAGC GCTGCCGOTC TTCGCCCTGC GCGTCGGCCA GGTAGCATTT 
7681 GACCATCGTO TCATAOTOCA GCCCCTCCGC GGOGTGGCCC TTGGCGCGCA GCTTGCCCTT 
7741 GGAOGSVQOCG CCGCACXAOG OGCAG1X3CAG ACTTOTAAGG GCGTAGAOCT TCGGCGCGAG 
7801 AAATACCGAT TCCGGGGAGT AGGCATCCGC GCOGCAGGCC CC6CAGAO0O OCTC GCATTC 
7861 CACGAOCCAG GTOAGCTCTO GCCG3TCG0G GTCAAAAACC AGOTTOCCCC CATGCTTTTT 
7921 GATGCGinC TTACCTOIGG TTTC3CATGAG CCGGTGTCCA CGCTCOOT6A CGAAAAGGCT 
7981 GrcC3GTX3TCC CCGTATACAG ACnGAGAGG CCTGTCCTCXJ AQCGGTGTTC CGCGGTCCTC 
8041 CTCGTATAGA AACTCGGACC ACTCTGAGAC GAAGGCTCGC GTCCAOGCCA GCACGAAGGA 
8101 GGCTAAGTCG GAGGGGTAGC GGTCGTTGTC CACTAOCGOO T CCACT OGCT CCAGOGTGTG 
8161 AACACACATG TCGCCCTCTT COOCATCAAG GAASQTCSATT OGTrTATAGO TOTAGOCCAC 
8221 GTGACCGGGT GTTCCTCAAG GOOOGCTATA AAAGGGGGTG OGGGCGOGTT CGTCCTCACT 
8281 CTCTTCCX3CA TCGOKSTCTG CGAGOGCCAG CTGTTGGGGT GAGTACTCCC TCTCAAAAGC 
8341 GGGCATCACT TCTOCGCTAA GATTCTCAGT TTCCAAAAAC GAGGAOGATT TGATATTCAC 
8401 CTCGCCCGCG GTCATCCCTT TOAOGGTGGC CGCGTCCATC TOGTCACAAA AOACAATCTT 

8461 TrronxsTCA agcttggtgg caaacgaccc gtagagggcg ttcgacagca acttggcgat 

8521 GGAGCGCAGG GTrrGGTrTT TCTCGCGATC GGCGCGCTCC TTGGCCGOGA TCTTTAGCTG 
8581 CACGTATTCG CGCGCAACGC ACCGCCATTC GGGAAAGACG GTGGTeCGCT CGTCGGGCAC 
8641 CAGGTCCACG CGCCAACCCSC GGTTGTOCAG GGTGACAAGG TCAACGCTGG TC3GCTACCTC 
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Nudeotide Sequence Analysis (cont) 

8701 TCCCOGTAGO CXXIVXPl'XOS TCCaGGJW3A3 GOCXSCCGCCC TTOOGCGAXC AGAMGGOGG 
8761 TAfiTGGGTCT A6CT6CGIGT CGTCCCSGCWG GSPCTCOOICC AOCSSTAAWSA CCCOQCSOCAC 
8821 CAOOCGOGOG lOQMVOXaOT CTATCTTOCA TCCTTOCAAO TCTAOOGCCT GCTGCCKIGC 
8881 OOGGGOOGCai AOCGOGOOCT OOaaaOOOTT CMSMOOGOA CCOCATCOC& 1I«»CCTOOeT 
8941 GAGOGGGGAa GOa33U»!XGC OOCMUmSTC OTMAOGTAO A0QQOCTCIC TSftOT ATTCC 
9001 AAOATAIGTA GaGXAGCIlLTC OTOCACOGOG OATCCTOGOG CGCAOGTAAT CGTATACTTC 
9061 G1GC62USGGA GGGAOGAfiGT OOOGACCGAS GITCCaaOGG GCXWGCOXSCP CTOCTOQQAA 
9121 OACTAieiGC CTOAMSAXCa CATCSTGAGTT GGATGATATC CITOGACOCT GGAAGAOGTT 
9181 OaAGClGGCG TCTOlt»GAC CTACOGCGIC AOGCAOGAAG GAGG0GTA06 A glOGOG CAG 
9241 CTTGWGACC AGCTCOGCOG TGACCTGCAC GTCTAOOGOG CACTA CIOC A OQGTTTCCTT 
9301 OATGATCTCA TACTTATCCT CTOOCTTTTT TTTCCACAOC •TOGCGGTIGA GCACAAAGTC 
9361 TTCGCGOTCT TXCCaOTACT CTOXSaATCGG AAACCCGTOO GC CTCOGA AC OGTAAOAOCC 
9421 TAOCATOEAS AACTOGTTOA CGGCCTOGTA GOOGCAGCAT CCCTTOTCTA OGGOTAGOGC 
9481 GTATGCCTOC GCGGCCTTCC OGAGOGAGOT GTOGGTSAGC CCAAAOGTGT CCCTAAOCAT 
9S41 GACTTTCAGG TAC1«GTAJPT TCAAeTCAGT GIOGTOGCSlT CCX5CCCTGCT CCCAGAOCAA 
9601 AAAGTCCGTXS CGCTTTTTOG AACOOOOGTT TGGCAGGGCG AAGGTGACAT CGTTGAAAAC 
9661 TATCITXCCC GOGOGAGGCA TAAACTTOOG TCTOATSOGG AAQGGTOCCO ©CACCTOOGA 
9721 ACOQTaXSPTA ATTACCTaGO CGGCGAOCAC GATCTCCTCG AAGCCGTTOA 4Wl'a\ JT C;GCC 
9781 CAOGATCTAA AGCTCCAA6A A6CGCG6GGT GCCCTTGATG CAGOGCAAaT TCTTAASTTC 
9841 CTOGTAfiCSTO AGCTOCTCAO GGGAGCTCAG CCCGTCTTCT GACAOGOCCC ACTCTOCAAG 
99.01 ATOAGGGTTC GAAGOGAOGA ATCAGCTCCA CAGGTCAOGG GCCATTAGCA •mOCAOOTG 
9961 OTCOC6AAAC GTCCTAAACT GGCGACCTAT OGCCArTTTT TCTOCGGTGA TCCAGTAGAA 
10021 GOTAAGCSSG6 TCTTOTiTOCC AOCOOTCCCA TCCAAGGTCC ACGGCTAGOT CTCGOGC3C3GC 
10081 eCTCXCCAOA OGCTCA1C1C CGOOGAACW CArPAACCAfiC ATSAACOOCA CeAOCTOCTT 
10141 CCCAAAGCCC CCCATCCAA6 TAaaSOTCTC TACaaCGTAG GX6ACAAAGA OAeOCTCfiCT 
10201 GCGAOC5ATOC GAGCOGATOG GGAAGAACTG GATCTCCCGC CACCAGTTOO AO GAQTOGCT 
10261 GTJQATCWSO TQAAAOTAGA AOTCCCTGCG ACGOOCCGAA CACTOCTGCT OGCTTTTGTA 
10321 AAAACGTOOG CAOn^CTOOC AGCGGTGCAC GGGCTOTACA TCCTGCACGA GGrTC^CTC 
10381 ACGACCOCGC ACAAGGAAGC AGAGTOGGAA TTTOAGCCCC TCG^TCG^ GCTTTGQqTG 
10441 CTGOTCTOCT ACTICGOCTS tfi-fGTCC CTG ACCOTCTOGC !rG«P0OAOOG GAGT^OGT 
10501 GGATCGGACC ACCACGCCGC GOGAGCCCAA AOTCCACATG TCOOCOOGOO OOOGTOGGAG 
10561 CTTSAIGACA ACATCGCGCA GATCGGAGCT GTCCATOCTC T06AGCTTOC GOOGOOACAG 
10621 fiTCAfiOCGOG AGCTCCTCCA GGTITACCTC GCATAGCCX3G GTCAGGGC6C GGGCTAGSTC 
10681 CAGGTOATAC CTOATTTCCA GaGGCTGGTX GGXGGCOGCO TOGATGACTT GCAAGAGGCC 
10741 GCATCCCCGC OGCGC!GACTA COOTACCOOG OGGCGGGCGG TGQGCCGCOG GGGTGTCCTT 
10801 GGATCATCCA TCTAAAAGCG GTGACGCGGG CGGGCCCCOO GAGGTAGGGG GGGCTCGGGA 
10861 CCCGCCOGGA GAGGGGGCAG CGGCACGTCG CCOCCGOGOG COOGCAGGAG CIGGTGCTCC 
10921 GGOOOGAGGT TCCTOGCGAA CGCGAOGACG CGGCGGTTGA TCTCCTCAAT CTGGCGCCTC 
i0981 IGCGTCAAGA CGACGGGCCC GGTCAGCTTG AACCTGAAAG AGAGaTOGAC AGAATCAATT 
11041 "itGGTCTCGT TCACOGCGGC CTQGCGCAAA ATCTCCTCCA CGTCTCCTGA GTTGTCTTGA 
11101 TAGGCGATTT CGGCCATCAA CTGCTCGATC TCTTCCTCCT GGACATCTCC GOOTCCOGCT 
11161 COCTCCACGG TOGCGGCCAG CTOGTTCGAG ATGCGGGCCA TGACCTQCCA CAAGGOGrTG 
il221 SStccS StTCcSaC CCGGCTGTAG ACCACGCCCC cnCOGCATC GCGGGCOCGC 

• 11281 aSaSacot gcgcoaStt gagctccacg tcccgggcga agacgcogta gtttogcagg 

"3" SSSSScG Ga^OGlTCTG CCACGAAGAA ^^^C 

11401 CAOCGTCGCA ACGTCGATTC GTTGATATCC CCCAAGGCCT CAAGGCGCTC CATCOCCTCG 
TAGAAGTCCA CGGCGAAGTT GAAAAACTCG GAGTTGCGCG CCGACACOGT TAACTCCTCC 
lltll JS^JSS S^J^ GGCGACAGOG ICGCGCACCT OGCGCTCJJ^A GGCTACAQGG 
11581 GCCICnCTO CTTCAATCTC CTCTICCATA AfiGGCCTCCC CTTCTTCTTC TICrrCTGGC 
11641 GGCGGTOGGG 6AGGGGGGAC ACOCCOGOGA CGAOGGCGCA CXX3GGAG0CG GTCGACAAAG 
11701 CGCTCGATCA TCTCCCCGCG GCGAOGGCGC ATQGTCTCGG TGACOGCGCG GCCGTTCTOG 
11761 CGGGGGCGCA GTK3GAAGAC GCCX3CCCGTC ATGTCCOQGT TATOGOTTCG CGGGGGGCTG 
11821 CCGTCCGGCA GGGATACOGC GCTAACGATG CATCTCAACA ATTCTTGTST AOGTACTCCG 
11881 CCACCGACGG ACCTOAOCGA OTCOGCATCG ACCGGATCGG AAAACCTCTC GAGAAAGGCG 
mix JSSSSgT ScAGTCGCA AGCTAOGCTC ACCACCGTGG CGGGCGC3CAG CGGGTGGCGG 
^o A S?TGGCGGA GGTXSCTCCTC ATGATGTA^ JJi^SS 

12061 CGGCCGATCG TCGACAGAAG CACCATGTCC TTGGGTCCOC CCTGCTGAAT GCGCAGGCCG 
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Nudeofiideileqaenoe Analysis (cont) 

1^121 TCGCOCAtCC CCCAGGCTTC OTITTGACAT COeCGCAOGT CTTTOTAOTA GTCTTGCATC 
i?161 JtfSCCTTTePA OOa(3CACTIC TXCTTCTCCT TCCTCTIOTC CSKXaTClCT TOCMCTATC 
12241 GCTACiOGGQG OaaOGCSAGIT TSSCOeTACO TOSOCCCCTC TK5CTCCCAT GOCTGXGACC 
12361 CCGAAGOCCC TCATCOOCTO AAGCAOGGCC AeeTCOGCseA CSJiaSOGCTC GGCTAATATe 
12361 GCCTGCTOCA CCTOCOIGAG GGTAGACTGS AAfiTCATOCA TOTCCACAAA OOOQfOOTAT 
12421- GOGOCCGIGT TGMW3TOTA AGIQCAOTTS CCCATAAOGO ACCAGTWOC OGTCTOGTeX 
12481 fcCOGGeiQOQ AGMCTCGGT GTACCTGAfiA OCSOGSAGTAAfi CCCTTOASTC AAA«»qGTAG 
12541 TCGTIGCAftS TCOGCACOtfJ GXACTGATAT CCCACCAAAA AfiTGCOGCGG OCGCTOOCGG 
12601 TAGAGQGC5CC AGCOTAGGGT GGCCGGGGCT COGGGSGOOA COTCOTCCAA CATAA«300A 
12661 TOATATCOCT ACATOTACCT GGACATCCAO OTOATaCCOO OGOOOGTOOT GOACCOOOQC 
12721 OGAAAGtCCC eGACGCCGTT CCAGAIGTTG CCCAGCGGCA AAAAGTGCTC CAIGGTOOOG 
127B1 ACGCTCIGGC CGQIGA0OOG TGOSCACTCG TTGACOCTCT AOACCOTGCA AAAOOAGAGC 
12841 CTOXAAGOOG GCACTCTTCC OZQSTC3GQT GGATAAATTC OCAACGtSOSAT CATCOCSGOAC 
12901- tSACXaSCXSTT OGAACCOCOO AtOOOflCOQT aiGCCeTOAT CCA 1G0GQ TT ACOO CCOGOS 
12S61 TCTCGAACCC AGGTOTGOGA CGTCACACAA OGOGOCaOOG CICCTTTIGO CTIOCTTCCA 
13021 GGCOCGGCGG CTOCTGOOCT AOCTTTTTTG GCCACTOGCC OCOOGOOGCG TAAGCQ GTTA 
13081 OGCTOGAAMS COAAAGCAOT AAOTCGCTCC CTOOCTOTAO eOGGAOeCTT ATTTTCCAAC 
13141 ^GTTCAerOS CAGOACOCCC GGTTCGAeTC TOflGGCCGGC CGGACTGCGG CXSAACGGGGG 
13201 TTTGCCTCCC CGTCMGCMi eACCCOGCTT GCAAATICCT CCOOAAACAS GSAOOAGCCC 
13261 GTTXaTTOCT TTXCCCAGAT GCATCOGGTQ CTCCGOCAOA TCCCCCCCOC TCCTCAOCAO 
13321 CGGCAAGAGC AAGAGCAGCG GCAGACATCC ACGGCACCCT CCCCTTCPCC TACCGOSTCA 
13381 GGAOOGCCAA CATCOGCGGC TOACOCX30CO GCACATGOTG ATTAOGAACC CCCGOOGCGC 
13441 GGGGCCCGdC ACTACdeOA CTTGGAG6AG O600AGO0OC TGGOGCGGCT AOGAOOGCCC 
13501 TCTCCTGAGC GACACCCAAS GGTGCAGCT6 AAGCGTOACA COCGOOAOGC GTACGI6CCe 
13S61 CGGCAGAACC TCTTICGCGA CCOCOAGGGA GAGGAGCOCO AOQAGATGOS OGATCGAAAC 
13621 iWCCACGCAO GGCGCOAGTT GOGGCATGGC CTSAACCGCG AGCGQTTGCT GCGCGAOGAfi 
13«81 GACTITOAflC CCGACG0GC6 GACCeOGATT AGTCCOOOGC GCGCACACOT GGCtaSCCGOC 
13741 OACCTOGTAA CCOCOXACGA CCAOACOOTG AACCAOGAGA TTAACTITCA AAAAAGCTTO 
13801 AACAACCAOG TCCGCACGCT TOTOOOGCGC CAGGAiOeTGO CTATAG6ACT GATGCATCIG 
13861 OGGGACraro TAAGCGCCCT GGAGCAAAAC CCAAATAGCA ASCCOCTOOT OOOGCAGCTC 
13921 •raCCCTATAG 1CCAGCACAO CAGOGACAAC GAGOCATTCA OOGATGCOCT CCTAAACATA 
£3981 OXAGA^ jS^OGCTO GCWCWQAT TIOATAAACA TTCTGCAGAG CATAGTOGTG 
14041. CAGQAGCGCA GCT2CAOCCT GGCTQACAAC CTOGCOGCCA TOAACTA^ SISfJSS 
14101 CTCGGCAAGT TTTACGCCCG CAAGATATAC CATACCCCTT AOGTTCCCM AGACAAeOAG 
14161 GTAAAGATCG AGGGGTTCTA CATCCGCATG GOGTTGAAaO TCCTTACCTO GAGCGACGAC 
14221 CTOGGCQITT A'TCGCAACGA CCOCATCCAC AAGGCCGTGA GCGTGAGCCG GCGOCGCGAG 
14281 CTCAflCGACC GCGAGCTCAT GCACAGCCTG CAAAGOOCCC TGGCTGGCAC GGGCAGCGCC 
14341 GATAGAGAGG CCGAGTCCTA CTTPGAOGCG GGCGCTGACC TGCGCTCGCC C^J^^J 
14401 CGCGCCC3XSG AGGCAGCIGG GGCCGGACCT GGOCTGGCOG TGGCACCCOC CCGCGCTGGC 
14461 AAOGTCGGCG GCGTOGAOGA ATATGACGAG OA0GAT6ACT ACGAOCCAGA OGACGGCGAO 
14S21 TACTAAGCGG TCATCTTTCT GATCAGATGA TCCAAGACX3C AACGGAOCOG GCGGTGCGGG 
14S81 OGGCGCTGCA 6AGCCAGCCG TCCGGCCTTA ACTCCACGGA CGACTG^GC ^GGTCWTOG 
14641 ACCGCATCAT GICGCTOACT OeGOOTAACC CTCACGCCTT CCGGCAGCAG CCGCAGGCCA 
• 14701 ACCGGCTCTC CGCAATTCTO 6AAGCGGTGG TCCC0GCOCX3 CGCAAACCCC AC^OGJ^ 
14761 AGGTCCTCGC 6ATCGTAAAC GCGCTQGCCO AAAACACGGC CATCCOOC^ «ATGA0OTOG 
14821 GCCI^GICTA CGAOGCGCIG CTTCAGCGCG TCGCTCGTTA ^ACAGCGGC AAC^WCMA 
14881 dcAACCTGGA CCGGCTCGTO GGGGATGTGC GCGAGGCCGT GGOGCAGCGT GAGCGOGCGC 
14941 AGCAGCAGGG CAACCTCOOC ICCATGCrrG CACTAAACGC CTTCCTGAGT ACACAGCCCG 
ISOOl CCAACGTCCC GCGGGGACAG GAGGACTACA CCAACmGT GAGOGCACTO CGGCTAATCG 
1S061 TCACTGAGAC ACCGCAAAGT GAGGTGTACC AGTCCGGGCC AGACTATTTT JT^JSA^ 
15121 GTAGACAAGG CCTCCAGACC GTAAACCTCA GCCAGGCTTT CAAGAACTTG CAGGGGCTCT 
15181 GGGGGGTCC6 GGCTCCCACA GGCGACCGOG CGACCGTCTC TAGCTIGCTG ACGCCCAACT 
15241 CGCGCCTCOT GCIGCTCCTA ATAGCGCCCT TCACGGACAG TGGCAGCGTG TCCCGGGACA 
15301 St^cSgg TCACTTOCTO ACACTGTACC GCGAGGCCAT AGGTCAGGCG CATGTGGACG 
mil SSS?? ?SSSS? ACAAGTOTCA GCCGCGOGCT SS^^ 
15421 GCCTCGAGGC AACCCTCAAC TACCTGCTGA CCAACCGGCO GCAGAAGATC CCCTCGTTGC 
1S481 ACAGTPTAAA CAGCGAGGAG GAGCGCATCT TCCGCTATCST GCAGCAGAGC GTGAGCCTTA 
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Nucleofide Sequence Analyds (conL) 

1S541 ACCTCAJtOCO CGAOOOGGTA ACGCCCASOG TGGCGCTGOA CATGACCGCG OGCAACATCG 
1S601 AACCGGGCAT GTATOCCICA AACCGGCCGT TTATCAATCG CCTAATQGAC TACTTSCATC 
.15661 GOGCGGCCGC CGTOAACOOC OAMATTTCA CCAATOCCAT CTT«ACOOC CACTCOCTAC 
1S721 CGCCCCXJTOG TTTCTACACC GGGGGATTTG AGGTCCCCeA COSTAACGAT CfiRTTCCTCT 
3:5761 XXSGAOQACAT AGAOOACAGC CPK5TTTTCCC CCCAACCGCA GACCCTOCTA GAOTTSCAAC 
15841 ABOGOOACSCA CGCAQMGOO GOfiClGOOAA AGGAWiOCTT CCGCAGGCCA AGCAGCTTGT 
15901 CCGATCTACSO COCTOCOGCC CCOCOOTCAfi ATCCGMW5AS CCCAOTTOCA AfiCTTOMSftO 
15»61 GGTCTTTTAC CAGCACTOGC ACCACCOGCC CeOCCCICCT COGOWOOftS CACTACCTAA 
16021 ACAACTOOCT OCTOCAGCXX3 CAGCGCOAAA AGAACCTCCC TCCOOCATTT CCCAACAAOG 
1«081 GSATAGASftG CCTASOXWAC AAGATCAGXA GATOSAAOAC GTATOCOCAG GAGCACAQGG 
16141 AIOIC C CCCO CCCCCeceCG CCCACOOOOJC OTCAAAGGCA COACCGTCAG CGOSGTCTfiG 
16201 TCTCGSAGGA CGAWACTCG CCWSAOOACA GCAGCQTCCT OOATWOCCA GGOACXOGCA 
16261 ACCCGTITOC GCACCTTCGC CCCAGGCTSG <5GAGAATCTT TTAAAAAAAA AAAAAAAAAG 

iSS StsSgSa AATAAAAAAC TCACCAAGGC catggcacog Jgogttggtt ttcttotwt 

16381 CCCCWACTA TCCACOOCGC OGCOATCTAT GAflGAACCTC CTOCTCCCTC CTACGAGJGC 
16441 GTCGIOAOOO COOOGOCAOT GGCGGOOGCG CTOOCSTTCOC ^J^JTCC TCCCCTflG^ 
16S01 COGCCGTTTG OGCCTCCGCC GTACCTGOGG CCTACCOOGO OOAOAAftCftO CATCOOTTAC 
wlei ^SSSS CACCCCTATT CGACACCACC CGTCTGTACC JT^G^ SJSSJS 
16621 <SATO«SGCAT CCCTOAACTA CCftfiAACGAC CACAGCAACT TTCTAJCCAC OCTCMTCftA 
16681 AACAMGACT ACAGCCCGGO GGAGOCAMC ACACAGACCA TCAATCTOGA CGACOGTTOG 
16741 CACTOOGGOG GCGACCIGAA AACCATOCTO CATACCAACA TOCCAAATCT GAACGAGCTC 
16801 ATCTTTACCA ATAAGTITAA OO0G0QGGT6 ATCOTGTOGC ^TCGOTHlC MAGJ^AJA 
16861 CAGGTCOAGC TGAAATATCA GTOGGTCGAG ITCACGCTOC CCGAGOQCAA CJAOPMG^ 
16321 ACCATGACCA TAOACCTTAT GAACAACGOG ATCGTGGAGC ACTACTTGAA ^TCOGCAOG 
16S81 CACAACGGGG iraCOOGAAAG CGACA1CGGG GTAAAGTTTO ACACCCGCAA CTTCAGACTO 
17041 GOGTTTGACC CAGl«ACTaG TCOTGTCATG CCTOGGGTAT ATACAAACGA AOOTTCCAT 
17101 CCAGACATCA TOTTCCTCCC AGGATOCGOG CTCGACTTCA CCCAC^CO CCIGACCAAC 
17161 TIGTTOGGCA TCCGCAAGCO GCAACCCTTC CASGAOGOCT TTAGGAICAC CTACGATCAC 
17221 CTCGAGGGTG GTAACATTCC CGCACTGTTG CATGTGGACG CCTACCAGGC AAGCTTAAAA 
17281 GATGACACOG AACAOCGOGO GGATGGCGCA GOCGGCCCCA ACAACAGTGG CAGCGGCGTO 
17341 GAAGAGAACT CCAAOGCOGC AOCOGOGGCA ATGCAGCOOG TCGMGACAT GA^ATCAT 
17401 GCCATICGCG GCQACACCTT TOCCACACGG C0GGAG6AGA AGCGCGCTGA OOCCGAGGCA 
17461 CCGGCAGAAG CTGCCGCCCC COCTGOGCAA CCCGAGGTOO AGAAOCCTCA 6AAGAAACCG 
17S21 GTGATCAAAC CCCTCACAGA GGACAGCAAG AAACGCAGTT ACAACCTAAT AAGCAATGAC 
17581 AGCACCTTCA CCCAGTACCG CAGCTOGTAC CTTGCATACA ACTACGGOGA CCCTCAGACC 
17641 GGGATCCGCT CATCGACCCT CCTTTOCACT CCTCACGTAA CCTG^GCTC «J^CAOCTC 
17701 TACTCCTOST TCCCAGACAT GATGCAAGAC CCCOTCACCT ^^OGCTCCAC ©JJCCAGATC 
17761 AGCAACmC CGGTCOTOOG CGCCGAGCTG TOGCCCCTGC ACTCCAAOAG «JCJJCAAC 
17821 GACCAGGCCG TCTACTCCCA GCICATCCGC CACTTTACCT CTCTSACCCA OGTOTTCA^ 
17881 CGCTTTCCCG AGAACCACAT TITCGCGCGC CCGCCACCCC CCACCATCAC JACCGTC^ 
17941 bAAAACGTrc CTCCWICAC AGATCACOGG ACGCTACOGC TCC^CAO CATCCG^A 
18001 GTCCAGCGAG TCACCATOAC TGACOCCAGA CGCCGCACCT GCCCCTACGT TTACAAQGCC 
18061 CTCGGCATAG TCTCGCCGCG CGTCCTATCC AGCCOCACTT OTTOAGCAAA CATGTCCATC 
18121 CTTATATCGC CCAGCAATAA CACAGGCTGG CGCCTGCGCT TCCCAAGCAA GATGTTOGGC 

. mil cSSSISJ SSSJSa ccaacaccca giccoccocc ocgggcxcta cce^Gccc 
18241 TCGOGCGOGC ACA>»CG60G cogcactggg cgcaccaccg tcgatgacgc cattgacgcg 
Itltl aSJgScS ctacaogccc acgcc«ccac cagtgtccac agtcgacgcg 

mil SStTCaS cStGGtSg COGAGCCCX3G CGTTA1GCTA AAATGAAGAG ACGGCGGAGG 

mil cgcgtSSc gSgccaccg ccgccgaccc ggcactccco cccaacgcgc ggcggcggcc 

mil SSJS^ ^SSS^S CACCOOCCGA CGGGCGGCCA TGCGGGOCGC TCGAACGCTG 
18S41 G^COOOTA TTCTCACTCT GCCCCCCAGG TCCAGGCGAC OAGCGGCCGC CGCAGCAGCC 
18601 Sgcc^a GTCCTATCAC TCAGGGTCGC AGGGGCAACG TCTACTCGGT GCGCGACTCG 
mil SgSJc CGTGCGCACC CGOCCCCgC OCAAC^T ^J^JAGAAAA 

18791 JUVCTACTTAG ACTCGTACTC TTGTATCTAT CCAGCGGCGG COOCGCGCAA CGAAGCTATG 

^^78^ tcSJSS iSrcSSA agaga-kktk: cagcttcatcg cgcoggagat ctatggcccc 

mil SSJSSS iSS^ ^ACAAOCCC CGAAJGCTAA ^^^t 
18901 AAAGATCAT6 ATGATGATGA ACTTOACGAC GAGGTGGAAC TGCTGCACGC AACCGCGCCC 
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18961 AGC5CGGCGGG TAC?«S«SaAA XOCWCGAOSC ClAAGAOCTO TOTTCCG^ CGGCACCAOC 
19021 OTAOTTTWA 0GCCCC«5TCA CCaCJ«»CC OOOkCCIACA ^^^^^^ TCMGJGCTG 
19081 TACGGCGACG AQGAOCOtSCT TCAGCAOOCC AACQA6CGCC TOGGOGJOTT JGJ^aOO^ 
19141 ijtfSOGSCATA MORCMaXT O0CXWXGC06 CTGGACGAOG GCMCCO^ UXfOmOA 
wJSi ^T^oS GOTGCroOCC ACGCWOCAC CGTOO^^ J^^S^SSS 

W261 cSaSwS ACICKJGTOA CTOXaCCC '"^^WSS ISSSS^ SiSSSSS 
i;321 OaCKSGAAG ATSTCTTOGA AAAAATGACC OTOQMCCTO OOCTOMOCC COJOOTC^ 
19381 CTCCOGOCAA TCAAGCAGOT OGCACCOOGA CTOOGOOTGC ^JCCGTGW^ SSJS^ 
19441 OCCAOCAOCA CTASCACTAO TATICOCACT CCCACAGAGG GCMTOG^AC ^CAAJTOTCC 
19S01 CCCSOTTOeCT CCGOGGIOSC ACaiTOOOeCO OTOCMGCGG ^^^^^ S^S^Ji? 
19561 ACCTCTACOO AGGTCCAAAC eGACCOOTOG MmtCO^ J^^^ 
19621 CGOOGTTCCA GGAAGTAOGG CACCGCCAOC GCaCTACTOC COOAMMOC CCTACM^ 
19681 TCCATCGCGC CTACCCCCGG CTATOCTOGC TACACCTAOC OOOCCAOJAO MOMOaOP 
19741 ACeSOtaOGOC GAACCACCAC TO0AACCC5GC CGCOGCCOTC OCOGTCGCCA GCCCOTGCTG 
inSi 0«0GC1CGC CW^^ 

19861 GCGOGCTACC ACOCCAGCAT COTTTWAAG CCGGTCTTTO TOCTTCTTOC AGATATOTOC 
19921 CTCACCTCCC GCCTCOGCTT CCCGGTOCCG eGATTOOCAC GAAOAATCCA OCgMgggG 
19981 GGCAOTGCCO GCCACGCXXn* GACGGGCGGC ATGOGTOGTO COCJCC^ ^SSSS^ 
20041 6CG1CGCA0C OTCGCATGCS OGGCfiOTATC CTGCCCCTCC TTATTCCACT GATOGCOGOS 
iOlOl ^^TTG^ StGCCOOO AATOICATCC GTGGCCTTGC AGGCGCAGftC ACACTGMnnL 
ISifl SS^ AXATC^T AAJAJGJCTG O^JCTC^ ^SSS 

(20221 CCTOTAACTA TTWCIACAA TOGAMACAT CAACWTG06 TCTCTOG^ SSJ^^ 
20281 CTCGCGCCOG TTCMOGOIA ACTOGCAM3A tCATOGGCACX: ^^JAM^ATCA OjeGTGGOOC 
20341 CTTCAGCTOG GGCTCGCTOT GGAGCGGCAT J^JJ*^ fS^StSJ tcJ^SJ^ 
20401 TCGCASCAAG GCCTGQAACA GCAfiCACAOO OCaSM^CM JOOf^J^ J^SSSS 
20461 AAATTTCCAA CAAAW3GTOG TAGATCOCCT <!GCCTeTOaC «ra*0«0000 TOWPOOJOT 
20S21 GGOCAACCAO GCAfilGCAAA ATAAGAOTAA CAGTAMCTT ^^CC^GCC ^gS^S 

r«r,i s=? ?^ = ™ 
i;r.isisj!ss!: ^s?^ 

isn; ^ 

->n<iit-i ,->^AfTv>— TY^ ftr-WTGGQGG TOCAATCCCT CAAGCGCCGA OGATGCrrCT. aaaiauuiaa 

21061. GOOCCCCCTO TCCAAGATG6 CTACCCCTIC GATCWGOCG cSSSSS^ 
21121 CTCGGGCCAG GACGCCTCGG AGTACCTCAG CCCCOOOCTC f^^^ SSgScCA 
21181 CGM5ACGTAC TTCAGCCTCA ATAACAACTT JAJJJ^CCC JCO^JS^S^ J^SagS 
21241 CGTAACCACA GACCCGICCC AGOGTrTSAC GCTGCGCTTC JJCCCTCTOG 
21301 TACCGCGTAC TCGTACAJAG CGC™ CCJOGCTGTG J^J^^ 

Ii42i SISSS^ JSSS GCI^^ CCTGTGAGTG 

2»8i SJSSS? SSS?^ ScCGOGCAGT TCCCGAGGAT GAMAAGAGG JAGATGAAGA 

'^^^ '^^^ 

l^ll ?SS;SSS^ SJSS -SaTGCJA GCC^g^ 

2ia4i togtcgtcaa tccgttctgg ttcoooatoa J^^^J^^^ SSJtcSJ ctaaacSS 

21901 GCAAlTCnC TCAA^A OCJCTT^ ^^^A SSS^S^ SS^S^J 
21961 AGTGGTrrTG TACAGTGAAG ATCTAAAT^ StcTtSS SaCAATCTA TOCCAXACAG 
22021 ACCTGGAAAA GGTGATC5AAA ATTCTAAAGC ^ATOTTaaCT l.^ ACAGCACTCG 

22081 ACCCAATTAC A^^^C^ ^SS^IS 1^^^ cSSSJIS JS^S 
22141 CAACATGGGT GTTCTTGCTG GTCAGGCATC ^J^^J^J aTAGGTCATA GAACCAGATA 
22201 SJ^J^S ?SSS?SS JJS^S SSSaTCCA GAIGTTAGAA TCATIGAAAA 
22321 SSSI^S S^JSS SSJStTA nGTrrrCCT CTTGOGGGTA rrrGGOGTAAC 
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Nttdeotide Sequence Analyds (cont) 

-i-y'iM n«ACAOCT^T CAW3CTATOA AGGCTAATGG CAMGGCTCA GGCGATAATS GAGMMjCTAC 
ilJS S^SSI S^SS JtoSaCXCC TAMGAAATA ACAMTTTSC 
^iSJ SSSJ^ SSwSSc CcScXAW eWSAAATTTC CPWACTCCA MATTO0C5CT 
§^6^ SJSJ^ C^CA^ 

22621 CACCTftCGAC TACATCAACA AGCGAOTOCT GOfTC^ ^^JSJS SScOOOO 
•22681 CCaTOGOGOG OGCOXSOTCTC TGC^CIACMC ^GJ^J^J SSSSSo JoOCCTTOA 

sTsJi SSSS S^^SSSJ^ SSSSJS? 

mil Satot sssj ssssjj 

i2921 CGA1CTTW5A GnOACOCCC CTAfiaMW^JJ SJSS 

.22981 cnCCCCATG GCCCACAACA CeOCCTCC^ ^SSJSS ^S^O^ TAOCOMCAA 
23041 CGACCAOTCC •TTTAATSACT ACCTTTO^ SSSJSS S^^Sw OCGGTTQGGC 
23101 C6CCACCAAC GIOCCCATCT «ATCC^TC JJ^SJ^ SSSSSo ACCcSSS 
23161 CTTCACACGC TTOAAGACAA AGGAAACCCC ^njCCCTOTOA ^^^^^^-^^ ACACCOTTAA 
93391 C3LCCTACTCT GtSCTOCATAC CATACCTIGA OOQAACClTC TATOTOATC ACMCTOTAA 
23221 CACCTAw-iv* »<wvnv?«W5T -EAGCTGOCOG OOCAAOOAOC OCCTCCTTAC 

23281 GAASGWSGCX: AOTACCTllG ^C"™??* fiGCraCAACX; TAGCICASTO 

23341 OCCCAATCaO rTOSAGATTA AACGCTCACT ^?»5S2S2 S^CAMA TXGGCTOcS 
23401 CAACAIGACC AAGGACTGGT «CTGGTGCA CMGTTO^ J^g^JS 
23461 GOOCIOCPAC ATTCCAGA^ SSSJSJ JSS^^^ SSSSoC AOOTTGOAAT 

SIS ^c^cc^ -c^- 

^. ^sssss s^s^ 

g J2 SSSSS ^cjjc 

23881 OTTTCAAGTC ™!JCOJQG TCOGJG^ SSS^ 1^^^ 

23941 CrTACCTGCCC ACOCCCTTCT CGGCOSGCAA COCCACAACA ICAAAOATCT 
24001 AACAACASCT GCCOCCATOO OCTCCACTCA OCAGGW^ ^J^S^S TTGTTTctS 
24061 OGGrrGTOGG CCATATTTTT «SG«CAC«A ^GA^^ SSSSS SSSS^S 
24121 ACACAAGCTC GCCTGCCCCA TAGTCAAW^ f^SSS ScTToSoC CCTTTOGOTT 
ji4181 GAaXJOCOTTT GCCTWSAACX: CGCGCTCAAA ^ACJJPOCTAC TCCGCCCTAG 
24241 TTCIGACCAA OGACXCAAGC AGGTTTACCA GTCTGACTAC J^J^^ SScGTCCA 
24301 CGCCATIGCT TCTTCCCCCG JCOGCTGTAT JJ^Sg 
24361 GGGGCCCAAC TOGOCCGCCT GTCSGACTATT CTGCTGCMG ^J* GGGTACCCAA 
24421 CTC5GCCCCAA ACTCCCATGG ATCACAACCC CA^MSAAC ^TWraftTOG 
24481 CICCAOCCTT AACAGTCCCC JGGT^GCC CACOCTGOCT OOC^CAGG JACAGCTCTA 
24541 CAGCTTCCTC GAGCOCCACT CGCCCTJCTT JcSSSaC ACTOTCAATA 

24601 CACnCTTTT TGICACTTGA JJAACA^ TAO«cSaC CCTIGCCGTC 

24661 AAQGCAAATG TmTATTTG TACACTCTCO ^S^SotC iATGCGC£3lC TOGCAGGGAC 
24721 IGCGCCGTW AAAAATOJAA GGGGWCTGC ^^J^ SISaCCAT CCCOOGCAGC 
24781 ACGTTGCGAT ACTOC?TCTTT AOTGCTCCAC TTAMCTCftS JJ^^^ CAGCTCGGGC 
24841 TCGGTOAAGT rTTCACTCCA CAGJCTCCGC JJgJ^ ^?^SaGTT GCGATACACA 
24901 GCCGATATCT TGAAGTCGCA CJTGGGGCCT ^OOCCCTGOT CACGCTCTO 
• 24961 GOGTTGCAGC ACTCGAACJJ TATCAG^ 00^^ 

25021 TO3GAGATCA CATCCGCGTC CAGGTCCTCC OOJTT^K^ AGKCCACTC GCACCOTAGT 

25081 ggtac?:tccc ttcccaaaaa og^SSS SS?SSS ^SSSctg CATGAAAGCC 

25141 GGCATCAGAA <»TG^0GTG CCCGOTCTGG <^^^ AGAAGAACAT GCCGCAAGAC 
25201 TTGATCTGCT TAAAAGCCAC CTOAOCCTTT CXSCAOCACCT 1X5CGTCGGTG 

25261 TOGCCGGAAA ACTCATIGGC CGGACAOGCC GCGTCMGCA ^JJ^^ COTCCTAGAC 
25321 TIGGAGATCT GCACCACATT TOGGCCCCAC CQOTTCTTCA ^.^^^ CACGTGCICC 
25381 IXSCTCCrrCA GCGCGCGCTC CCCGTTTTCG CrCCTCftCAT AGCGCAOCGG 
25441 rrATOTATCA TAATGCTC^ SS^cSg SgTTACCTC TGCAAACGAC 

25501 1GCAGCCACA ACGCGCAGCC OGTCGGCTCG TGOTGCTTCU ^^^^ gcTCGTGAAG 

25561 ogcaggtacg cc««:aggja JJ^JJ^ JS^SSS?? JSSSS CGCCAGAGCT 

2S621 GTCAGCTGCA ACCCGCGGTG CTCCTCGT^ ^^^^^ CGTTAaXICAC GTOGTACTTG 
25681 TCCACITOGT CAGGCAGTAG SZZSJJSTJ! ?SScACO CAGACACGAT CGGCAGGCTC 
25741 TCCATCAACG CGCGCGCAGC CTCCATGCCC TTCTCCCAca ^./v 
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Nucleotide Sequence Analysis (oont) 

2S801 AOOOOCTTTA TCACCCTCCT TTCACTTTCC GCOTCACTOQ ACTCTTCCTT TOCCTCWCC 
2S861 GTCCGCATAC CXXXXXSCX^AC TCGGTCGXCT TCATTCAfiOC GCCGC ACCGT GOOCTTACCT 
25921 CCCTTCOCGT GCTTCATTAG CACCGOTGGG TTGCTCAAAC CCAOCATTTG TAGCCCCACA 
25981 TCTICItiyr rr CTICCTOGCT GICCAOCATC ACCTCTOOGG ATOGOGGGC6 ^I^DQGGCTIC 
26041 GGAGAGGOGC GCTTCTTTIT C TTTTTSQAC GCAATOGCCA AATCOOCOGT CGAOGTOfiAT 
2^101 GGCCGOGC5GC TOCSGOCTGOO COGCACCAOC QCATCTIOTO ACCAfiTCTTC TICOTCCTOG 
26161 GACTCCAGAC GCCX5CCTCAG CCGCTTTTTT GGGGGCGCGC G0GGAGGCG6 CGGOCAOGGC 
26221 CAOGGGGAOQ ACACGTCCTC CATaGTllGGT OGACGTOGOG COGCAOCCCQ TCCGOGCTOG 
26281 CGGGTCOTTT OGCGCTGCTC CTCTJCCO G A CTGGCCATTT CCnCTCCTA TAiSGGAGAAA 
26341 AAGATCA!rGG AGICAGTCGA GAAGGAGGAC AOCCT AAOQ g COOCCTTIQA. C TTO GQC ACC 
26401 ACOGCCTCCA C03ATGCCGC CAAOGOGCCT ACCACCTTCC COGICGAGGC ACCOCOGCTT 
26461 GAGGAGGAOG AAGTCATTAT OGAGCAOGAC CCAGGTTTTO TAAGCCAAGA OOAOGAGGAT 
26521 OGCTCAOTAC CAAGAGAGGA TAAAAJ^GCAA GACCAGGACG ACGGAGAGGC AAAOGAGGAA 
26581 CAAGIOOGGC GOGGGGACCA AAGCeATOGC. GACTACCTAO ATCTOGGAGA OGAOCTOCOXS 
;2^^6641 ITGAAGCATC TOCAGCGCCA GTGOGCCATT ATCTOOGACG CGTTGGA AGA GCGCASOGAT 
26701 CTGCCCCTCG CCATAGCGGA TGICAGCCTT GCCTAOGAAC 6CCACCTGTT CTCACCGOGC 
26761 GTACCCCCCA AACGCCAAGA AAAOGGCACA TCCGAGCCCA ACCCGCGCCT CAACTTCTAC 
26821 CCOGTA1TTO CCGTOCCAOA GGTGCTOGCC ACCTATCACA TCTTTTTCCA AAACTGCAAG 
268&1 ATACCCCTAT GCOXSOOGTOC CAACCOCASC CGASCGQACA AGCACCTOGC CWOCOGCAC 
26941 GGCGCTGTCA TAOCTCATAT OGCCTCGCTC GACQAAGTGC CAAAAATCTT TCAGGGTCTT 
270.01 GGAOGCOACO AGAAAOGGGC GGCAAACGCT CTCCAACAAG AAAACAGOGA AAATOAAACT 
27061 CACOXSTO GA O TtSCTCGTOGA ACTTOAGGGT GACAAOGCGC GCCTAGCOGl* OCTGAAACGC 
27121 AGCATCGAOG TCACOCACTT TGCCTACCCG GCACTIAACC TACOCCOCAA OGTTAICACC 
.27181 ACAGTCATCA GCGAOCTCAT CGTOOGOCGT GCACCACOCC TOGAGAGGOA TGCAA MTTC 
27241 CAAGAACAAA CCGAGGAGGG CCTACCCGCA GTTCGCGATC AGCAtSCTOGC GOGCTCQ CTT 
27301 CAGAOGCGGG AGCCTOCCGA CTTGGAGGAG OGAOGCAAGC TAATCATGGC CGCAGTCCTT 
27361 GTTAcbGOXSO AGCTTOAGTC CATOCAGCGG TTCTITCCTG ACCCQGAGAT CCAGOGCAAG 
27421 CTAGAGGAAA GGTTOCACTA CACCTTTCGC CAGGGCTAOG OXSCOCCAGGC CIGCAAAATT 
27481 TOCAACGTOG AGCTCTOCAA CCTGGTCTCC TACCTOCGAA TTTTGCACGA AAACCCCCTC 
27541 GGGCAAAACG TCCTTCATTC CACGCTCAAG GGCGAGGCOC GCCGOGACTA OGTOOGOGAC 
27601 TCOGITXACT TATTTCTCTG CTACACCTGG CAAACGGCCA TGGGCGTGTO GCAGCAATCC 
27661 CTCGAGGAGC GCAACCTAAA GGAGCTCCAG AAGCTCCTAA AGCAAAACIT GAAQQACCTA 
27721 TOGACGGCCT TCAACGAGCG CTCCX5TGGCC GCGCACCTOG CGGACATTAT CTTCCCOGAA 
27781 CGCCTGCTTA AAACCCTGCA ACAGGGTCTG CCAGACTTCA CCAGTCAAAC CAyCTIGCAA 
27841 AACTOTAGGA ACTOTATCCT ACAGCOTTCA GGAATTCTGC CCGCCACCTC CPGTCCGCTT 
27901 CCTAGCGACT ITGTGCCCAT TAAGTACCGT GAATGCCCTC CGCCGCTTTG GGGTCACTOC 
27961 TACCTTCTCC AGCTAGCCAA CTACCTTCCC TACCACTCCG ACATCATGGA AGACGTGAGC 
28021 GGTGACGGCC TACTCGAGTC TCACTOTCGC TGCAACCTAT GCAC CCCGC A CCGCTCCCTG 
28081 CTCIX3CAATT CGCAACTCCT TAGCGAAAGT CAAATTATOG GTACCTTTOA GCTOCAGGCT 
28141 CCCTCGCCTC ACGAAAAGTC CGCGGCTCCG GGGTTCAAAC TCACTCCGGG GCTCTCGACG 
28201 TOGGCTTACC TTCGCAAATT TGTACCTGAG GACTACCACG CCCACGAGAT TAGGTTCTAC 
28261 GAAGACCAAT CCCGCCCGCC AAATOCGGAG CTTACCGCCT GOGTCATTAC OCAGGGCCAC 
28321 ATCCTTCGCC AATTOCAAGC CATCAACAAA GCCCGCCAAG AGTTTCTGCT ACGAAAOGGA 
28381 CGGGGGGTTT ACCTCGACCC CCAGTCCGGC GAOGAQCTCA ACCCAATCCC CCGGCGGCCG 
• 28441 CAGCCCTATC AGCAGCCGCG OGCCCTTGCT TCCCAGGATG GCACCCAAAA AGAAGCTGCA 
28501 GCTCCCGCOG CCGC6ACCCA CGGACGAGGA GGAATACTOG GACAGTCAGG CAGAGGAGGT 
28561 TTTGGACGAG GAGGACGAQA TGATGGAAGA CTCGGACAGC CTAGACGAAG CTTCCGAGGC 
28621 CGAAGAGGTG TCAGACGAAA CACCGTCACC CTCGGTCGCA TTCCCCTCOC OGGCGCCCCA 
28681 GAAATTCGCA ACCXSITPCCCA GCATOGCTAC AACCTCC3GCT CCTCAOGCGC OGCCGGCACT 
28741 GCCTCITCGC CGACCCAACC GTAGATGGGA CACCACTOGA ACCAGGGCCG CTAAGTCTAA 
28801 GCAGCCGCCG CCGTTAGCCC AAGAGCAACA ACAGCGqCAA OGCTACCGCT CGTCGCGCGG 
28861 GCACAAGAAC GCCATAGTTC CTTOCTTGCA AGACTCTGGG GGCAACATCT CCTTCGCCCG 
28921 CCGCTTTCTT CTCTACCATC ACGGCGTGOC CTTCCCCCGT AACATCCTGC ATTACTACCG 
28981 TCATCTCTAC AGCCCCTACT GCACCGGCGG CACCGGCAGC OGCAGCAACA GCAGCGGTCA 
29041 CACAGAAGCA AAGGCGACCG GATAGCAAGA CTCTCACAAA GCCCAAOAAA TCCACAGCGG 
29101 CGGCAGCAGC AGGAGGAGGA GCGCTGCGTC TGGCGCCCAA CGAACCCGTA TCGACCCGCG 
29161 AGCTTAGAAA TAGGArTTTT CCCACTCTCT ATGCTATATT TCAACAAAGC AGGGGCCAAG 
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Nucleotide Sequence Analysis (cent.) 



29221 AACWU»GCT GAWATAAAA AACAG<5TCTC TGCGCTCCCT <ACCOOOJOC ^?OCWATC 
29281 ACAAAAGCGA AGMCJuSCTT CtSOOGCAOSC TOOAAGAOCSC CG7U3GCTCTC OTXakCCMAT 
29341 ACTCOGCGCT GACTCTTA^S GACOSkCTXTC GOGCOCXTTC TCWATTOJA ^CGAJAAC 
29401 TACGTCATCT CCAGCGC5CCA CACCOCSaoaC OlGCACCTOT CCTCWSOflCC ^ATOMCA 
29461 AeaAAAT3X:C CACOCCCTAC ATOTflOMTT ACCACCCACA AATGGGACOT GOOGCTSajG 
29521 CTCCCCAACA CXACTCAACC CGAATAAACT ACAIXSAfiOGC COGACCCCAC ATOftmiTaCC 
•29581 OaSTCAAOOC AATCOGOeOC CACOOWACC GAATTCTCCT CQAACMGCG GCTATTACCA 
29641 CCACACCICO TAATAACCOT AMCCCCGIA OOTOOCCCOC aCCCCTOOlO ISiCCAGGAAA 
29701 trrCCCGCTCC CACCACTCTO CTACTTCCCA GASAOOCCCA GGCCGAAaiT CACATSACTA 
29761 ACTCAGGGGC GCAGCTTOCO GGCGGCTTTC GTCACASGCT GCGGTOGCCC OGOCMGOSA 
29811 TAACTCACCT CAAAATCASA GGGOGACGTA rrCAfiCTCAA OQAOGACTOO OTBAOCTOeT 
29881 CTCTTOGTCT CCOTCCGGAC GGGACATPrC AfiA«»OCGG OSCT^C^ JCTTCATTOA 
29941 CGCCCCGTCA GGOCSATCCTA ACOCTGCAOA CCTCGTCCTC O«CC0GCGC TCO SGAGGCA 
30001 TIOGAACTCT ACAATTTAtT GAOGAOTTOG «JOCTTCGGT J^CTTCMC ^CTTTTCTG 
30061 CACCTCCOGG CCACTACCCO OACCAGmA TTCCCAACTT TCJjeOOCTG AAAQACTCOG 
30121 COGAOOOCTA OGACTOAATC ACCAGTGGAG AGGCAfiAGCG ACTGCCCCPS ACACACCTOG 

30181 aSactgccg ccgccacaag tcctttoccc gcgoctccog tgagtttt^ ^iS^iS 

30241 TCOCCGAAGA GCATA1XXSAO CGCCCOGCGC ACGGCGTCCG GCTCACCACC C^MMC 
30301 TTACACXSPAG CCTCATICGG GAGTTTACCA AGOOCCCCCT GCTAGTOGAG OGGGAGOOOG 
mil ^^^^ ^JSSS GnOGCAACT CTCCTAACCC JCGJ^CAT CA^TCTTT 
30421 GTTGICATCT CTCTGCTCAO TATAATAAAT ACAGAAATTA GAATCTACTG GGJCTCCTOT 
30481 CcSScCTG TCAAOGCCAC CGrTTTCACC CACCCAAAGC ACACCAAAGG JJA^^AOC 
30S41 TCOOGWroC ACAAGCGGGC CAATAAGTAC crTACCTCGT ACTTTAAOOG CT^C^ 
30601 GTAATTTACA ACAGTPTCCA GCOAGACGAA CTAAOTTTGC CAC^CGT JWTCOOCTTC 
30661 AACTACACOG TCAAGAAAAA CACCACCACC ACCACCCTCC TCACCTCCCG GGAACGTACG 

GCCCACACCT ACAGCCTGAG CGTAACCAGA CATTACTCQC 
30781 ATirrtCCAA AACAGGAGOT OAGCTCAACT CCCOGAACTC AGCTCAAAAA JJCA^TCC 
30841 OGOGTCCTCG GATTTTOTAA TTAAGTATAT OfCAMTCA JS^JJ^^ SSSSSSS 
30901 TAATTOTCT GGAATTOOGG -PCGGGGTTAT CCTTACICTT GTAMTCTOT ™JJ°^^ 
30961 ACTAGCACTT CTCTCCCTTA GGGTTCCCGC CTGCTGCACG CACGTCTOTA ^TATTCTCA 
31021 OaPPrTTAAA CGCTCGGGGC AACATCCAAG ATGAGGTACA TOATTTTAGG CTOGCTCGCC 
iiSsi cSgOGGCAO TCTCCAGCGC a«CCAAAAAG GTTGAGTTrA AGGAACCAGC ^GCAATCTT 
31141 ACATOTAAAT CAGAAGCTAA TCAATCCACT ACTCTTATAA ^^^^2^^ JSSSSSJ 
31201 AAGCTTArrA TTCGCCACAA AGACAAAAOT GOCAAGTATO CTGTATATGC TATTI GGCAG 
3126^ ^^Zk TAAICICACA Gl^CAAG G^GAAAATCG TAAAACTTW 

31321 .ATCTATAAAT TTCCATTCTA TOAAATGTGC GATATTACCA T STACATGAG CAAACAGTAC 

3l38i SgtSSc CCCCACAAAA GTCTTTAGAG aacactggca ccttttcttc caccgctctg 

31441. CTTATTACAG CGCn^TTW GGTATGTACC TTACWTATC JCAAATACAA J^AOGC 
31S01 ACTTrrATTG ATGAAAAGAA AATCCCTTSA irrrTCCGCTT GCTTOTATTC CCCOXSGACAA 
mil S^S^AOG C«:CACGCGG GCAAGATO^T JCCCACAA^ JZgSSSS 

31621 ACTTTCCTOG ACGTTAGOGC CTOATTTCTG CCAGCGCCTG CACTOCAAAT TTGATCAAAC 
3^68^ SIgSSS SSSJlGCT CCAfiAGAlGA CCGOOtCAAC CATCGCGCCC 
31741 ATCGCAACAC CACICCTACC GGACTAACAT CTCCCCTAAA TTTACgCAA ^TCATOCCT 
31801 rrtSTCAATCA CTCGGCGAGC TTGGACATGT GGTCGTTTTC CATAGCGCTT MGWTCTTT 
a 1661 gcStatoat tatgtcgott atttcttgcc taaagcgcag ACGCGCCAGA CCCCCCATCT 
3^921 SS^A? SJSSS? AACCCACACA ATGAAAAAAT ^J^JJ^ ^SSa 
aiQfil AACCATGTTC TCrrCTTTTA CAGTATGATT AAATGAGACA TQATTCCTCG AGTrCTTATA 
iloll ttgttgSct TTTCIGTCCG TCCTCTACAT TGGCCGCGGT CGCTCACATC 

32?;^ SS^S SSSSS ^ZACAGTP TACCTGCTTT JOGGA^ Sg^S 
32161 CTCATCTOCA GCCTCCTCAC TGTAGTCATC GCCTTCArTC AOTTCATTGA CTGGGTTTCT 

32221 S^SSSS SS^SSg GCACCAOC^ ^^^^ JSSSSJ? 
32281 CTCAGAArrC TTTAATTATG AAACGGAGTG TCATTrTTGT OTTOCTCATT TTTTGCGCCC 
t^tlT TAeOTCTGCT 7TGCTCCCAA ACCTCAOCGC CTCCCAAAAG ACATATTTCC TGCAGATTCA 
SSSSS JSSSSc AGCTCCTACA ACAAACAGAG CGATTTGTCA GAAGCCTGGT 

HaII SS?^^ ?^?SSg?c ATGGTTrrrr gcagtaccat ttttgcccta gccatatatc 

Hill S?ic?SSA CaSScTCG AA1X5CCATAG ATGCCATCAA CCACCCTACT TTCCCAGTGC 
32521 CATACCTTGA CATTGGCi\^ ^^CogtTATTC CCCCAATCAA TCAGCCTCGC CCCCCTTCTC 
32581 CCGCTGTCAT ACCACTGCAA CAGGTTAi i\- t.t.wv_~» 
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Nudeotide Sequence Analysis (cont.) 

32641 CCACOOCXAC lOMATPAGC TACTTTAATP IXSACAOCIGfi AGATOACTOA ATCTCTAGAT 
3270a CTAfiAATTOO ATOGAATiaA CaiCCOAACAC (MCCTACTAfi AAAGGCOCAA GCCOfiOCSTCC 
32761 GW3CGAGAAC GCCOSAAAACA AGAACTICAX CACKPGCTTA ACCTACAOCA GTOmVAAAGA 
328?1 OCSTATCTTTT GTOTOGTCAA CCA0GCCA2A CTTACCTAOS AAAAWiCCAC TACOOGCAAC 

sizaai cgcctcagct acaagctacc cacccascgc caaaaactgg tocttatoct gogagaaaaa 

32d41 CCTATCACCG TCACCCAGCA COXXSOCACSAA ACAGAGGQCT GCCTCCACTT OCCCT ATCAG 
33001 OQTCCASAQG ACCTCTCCAC WTTATIAAA ACCATCTOOC CTATTAGAeA TCTTATTCCA 
3306i TTCAACTAAC ATAAACACAC AATAAATXAC TTACTTAAAA TCAGTCAGCA AATCTTTOTC 
33121 CAGCTTATTC AGCATCACCT CCTTTCCTTC CTCCCAACTC VGOTXTCTCX OCCOCC CTTT. 
33181 AOCTOCAAAC TTTCTCCAAA GTT13AATQG CATGTCAAAT TCCTCATCTT CTTGTCCCTC 
33241 CGCACCCACr ATCTTCATAT TOTOXSCASAT CAAACGCttCC AGACCGTCTC AACACACCTT 
33301 CAAOOOOQTO TATCCATATO ACACAOAAAC CGGGCCTCCX ACTGTGCCCT TTCTXACCCC 
33361 TCCArnXSTT TCACCCAATC GTTTCCAAOA MGfSCCCCCfS GGAGTICTCT CTCTAOCOCT 
33421 CTCCGSAACCT TOCGACACCT CCCAOOaSlT CCTTCOGCTT AAAATOCSGCA GOGGTCTXAC 
33481 CCTAGACAAG GCCGGAAACC TCACCTCCCA AAATGTAACC ACTGTTACTC AGCCACTTAA 
33S41 AAAAAqAAAG TCAAACATAA GTTTtJOACAC CTCCGCACCA CTTACAATTA CCTCAGCCCC 
33601 CCTAACAGTO OCAACCACOS CTCCTCTGAT AGTTACTAfiC CGOOCTCTrA GCGTACAGTC 
33661 ACAAGCCCCA CTGACCGTGC AAGACTCCAA ACTAAGCATT GCTACTAAAG OGCCCATIAC 
33721 AGIGTCAGAT GGAAAGCTAG CCCTCCAAAC ATCACCCCCC CTCTCTGGCA GT6ACAG0GA 
33741 CACCOTPACT GTAACTCCAT CACCCCOGCT AACTACTGCC ACGGGTAfiCT TGGGCATTAA 
33841 CATGGAAGAT CCIATTTATO TAAATAATQO AAAAATAGGA ATTAAAATAA GOGG TCCTT T 
33901 GCAAGTAiSCA CAAAACTOOO ATACACTAAC ACTAGTTACT GOACCAOOTC TCACCCSTTOA 
33961 ACAAAACrcC CTTAGAACCA AACTTOCAGC AGCTATTGGT TATGATTCAT CAAACAACAT 
34.021 GGAAATEAAA ACGGGTOGTC GCATCOGTAT AAATAACAAC ITCTTAATTC TAGATGTOCA 
34081 TTACOCATTT GMGCTCAMi. CAAAACTACG TCTTAAACTO GGGCAG CGAC GCCTQTATAT 
3414i TAATCCATCr CATAACTTCG ACATAAACTA TAACAGAOGC CXATACOTTT TTAATCCATC 
34201 AAAGAATACT AAAAAACTGG AAGTTACCAT AAAAAAATCC ACTCCACTAA ACTTIGATAA 
34261 TACTGCCATA GCTATAAATC CAGGAAASCSG TCTGGAGTTT GATACAAACA CATCTGACTC 
34321 TCCAGATATC AACCCAATAA AAACTAAAAT TGGCTCTCGC ATTCATTACA ATGAAAAOGG 
34381 TOCGATSATr ACTAAACTTC GAGOGGCTTT AAGCTTTCXC AACTCAGGGG CCATTACAAT 
34441 AGGAAACAAA AATCATGACA AACTTACCCT GTGGACAACC CCAGACCCAT CTCCTAACTG 
34S01 CAGAATTCAT TCAGATAATC ACTOCAAArT TACTTTQGTT CTTACAAAAT GTCGGAGTCA 
34561 AGTACTAGCT ACTCTAGCT6 CTTTGGCTCT ATCTGGACAT cmCATCCA TGACAGGCAC 
34621 OGTIGCAAGT GTrAOTATAT TCCTTACATT TCACCAAAAC GGTCTTCTAA TCGAGAACIC 
34681 CTCACTTAAA AAACATTACT OGAACTTTAG AAATGGGAAC TCAACTAATG CAAATCCATA 
34741 CACAAATCCA GTrGCATTTA TGCCTAACCT TCTAGCCTAT CCAAAAACCC AAAGTCAAAC 
34801 TGCTAAAAAT AACATKSTCA GTCAAGTTTA CTTGCAT6GT CATAAAACTA AACCTATGAT 
34861 ACTTACCATT ACACTTAATG GCACTAGTCA ATCCACAGAA ACTAGCGAGG TAACCA CTTA 
34921 CTCTATCTCT TTTACATCGT CCTGGGAAAG TGOAAAATAC ACCACTCAAA CTTTTGCTAC 
34981 CAACTCTTAC ACCTTCTCCT ACATTGCCCA GGAATAAAGA ATCGTGAACC TGTTGCATCT 
35041 TATCITTCAA CGTCGGA1CC TTTATTATAG CGGAAGTCCA CGCCTACATC GGGGTAGACT 
35101 CATAATCGOG CATCAGGATA GOOCGGTGGT CCTGCAGCAG CGCGCGAATA AACTGCTGCC 
3S161 GCCGCOGCTC CGTCCIVSCAG GAATACAACA TCGCAGTGGT CTCCTCAGCO ATGATTCGCA 
35221 CCCCCCGCAG CATGAGACGC CTTOTCCTCC GGGCACAGCA GCGCACCCTG ATCTCACTTA 
■ 35281 AATCAGCACA GTAACTGCAG CACAGCACCA CAATATTCTT CAAAATCCCA CAGIGCAAGG 
35341 CGCTCTATCC AAAGCTCATC GCOGGGACCA CAGAACCCAC GTGGCCATCA TA CCACAAGC 
3S401 OCAGGTAGAT TAAGTCGCGA CCCCTCATAA ACACCCTGGA CATAAACATT ACCTCTrTTG 
35461 GCATCTTGTA ATTCACCACC TCCCGGTACC ATATAAACCT CTGATTAAAC ATGGCGCCAT 
35S21 CCACCACCAT CCTAAACCAG CTGGCCAAAA CCTGCCCGCC GGCTATCCAC TGCAGGGAAC 
35581 CGGGACTCGA ACAATCACAG TOGAGAGCCC AGGACTCGTA ACCATGGATC ATCATGCTCG 
35641 TCATCATATC AATCTTGGCA CAACACAOGC ACACGT<3qAT ACACTTCCTC AGGATTACAA 
35701 GCTCCTCCCG CGTCAGAACC ATATCCCAGG GAACAACCCA TTCCTGAATC AGCGTAAATC 
35761 CCACACIGCA GGGAAGACCl' CGCACGTAAC TCACGTTQTC CATTCTCAAA GTC53TACATT 
35821 CGGGCAGCAG CGGATCA1CC TCCAGTATGG TAGCGCGGGT CTCTOO^A AAAGGAGGTA 
35881 GGCGATCCCT ACTGTACGGA GTGCGCCGI.G ACAACCGAGA TCGTGTTGGT CGTAGTGTCA 
35941 TCCCAAATCG AACGCCGGAG GTAGTCATAT TTCATCGACA CGGCACCAGC TCAATCAGTC 
36001- ACAGTCTAAA AAGGGCCAAG TACAGAGCGA GTATATATAG GACTAAAAAA TGACGi^ACG 
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MudedtideSequam:e Analysis (eont) 

36061 <OTAAAGTCC ACAAAAAACA CCCASAAAAC CeCACeOflAA CCTACCCCCA GAAAC GAAAG 
36121 CCAAAAAACC CACAACTTCC TCAAATCTIC ACTTCOGTCT TCCCACGATA OeTCACTTCC 
36181 CATTTTAAAA AAACTACAAT TCCCAATACA •TOCAACSTTAC TCOOCCCTAA AACCTXCOTC 
36241 AixOGCCCCO WCCCACOCC CCCOCOCAOG TCACAAACTC CACCCCCTCA T»TCATATT 
•36301 GCSCITCAATC CAAAATAAGG TATATXATQA TSAIG 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Gregory, R.J./ Armentano, D., Couture, L.A., Smith, 
A,E. 

(ii) TITLE OP INVENTION: GENE THERAPY FOR CYSTIC FIBROSIS 



(iii) NUMBER OF SEQUENCES: 9 



(iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: LAHIVE & COCKFIELD 

(B) STREET: 60 STATE STREET, SUITE 510 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: USA 
20 (F) ZIP: 02109 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 . (C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: ASCII 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
30 (B) FILING DATE: 02 -DEC- 1993 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/985,478 
35 (B) FILING DATE: 02-DEC-1992 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: Hanley, Elizabeth A. 

40 (B) REGISTRATION NUMBER: 33,505 

(C) REFERENCE/DOCKET NUMBER: NZI-014CP2PC 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: {617) 227-7400 
45 (B) TELEFAX: (617) 227-5941 



(2) INFORMATION FOR SEQ ID N0:1: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6129 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



55 



(ii) MOLECULE TYPE: cDNA 



97 



10 



20 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 133.. 4572 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



AATTGGAAGC AAATGACATC ACAGCAGGTC A6AGAAAAAG GGTTGAGCGG CAGGCACCCA 60 
GAGTAGTAGG TCTTTGGCAT TAGGAGCTTG AGCCCAGACG GCCCTAGCAG GGACCCCAGC 120 



GCCCGAGAGA CC ATG CA6 AGG TCG OCT CTG GAA AAG GCC AGC GTT GTC 168 
Met Gin Arg Ser Pro Leu Glu Lys Ala Ser Val Val 
15 15 10 



TCC AAA CTT TTT TTC AGC TGG ACC AGA CCA ATT TTG AGG AAA GGA TAC 216 
Ser Lys Leu Phe Phe Ser Tip Thr Arg Pro lie Leu Arg Lys Gly Tyr 
15 20 25 

AGA CAG CGC CTG GAA TTG TCA GAC ATA TAC CAA ATC CCT TCT GTT GAT 264 
Arg Gin Arg Leu Glu Leu Ser Asp lie Tyr Gin lie Pro Ser Val Asp 
30 35 40 

25 TCT GCT GAC AAT CTA TCT GAA AAA TTG GAA AGA GAA TGG GAT AGA GAG 312 
Ser Ala Asp Asn Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu 
45 50 55 60 

CTG GCT TCA AAG AAA AAT CCT AAA CTC ATT AAT GCC CTT CGG CGA TGT 360 
30 Leu Ala Ser Lys Lys Asn Pro Lys Leu lie Asn Ala Leu Arg Arg Cys 

65 70 75 

TTT TTC TGG AGA TTT ATG TTC TAT GGA ATC TTT TTA TAT TTA GGG GAA 408 
Phe Phe Trp Arg Phe Met Phe Tyr Gly lie Phe Leu Tyr Leu Gly Glu 
35 80 85 90 

GTC ACC AAA GCA GTA CAG CCT CTC TTA CTG GGA AGA ATC ATA GCT TCC 456 
Val Thr Lys Ala Val Gin Pro Leu Leu Leu Gly Arg lie lie Ala Ser 
95 100 105 

40 

TAT GAC CCG GAT AAC AAG GAG GAA CGC TCT ATC GCG ATT TAT CTA GGC 504 
Tyr Asp Pro Asp Asn Lys Glu Glu Arg Ser lie Ala lie Tyr Leu Gly 
110 115 120 

45 ATA GGC TTA TGC CTT CTC TTT ATT GTG AGG ACA CTG CTC CTA CAC CCA 552 
lie Gly Leu Cys Leu Leu Phe lie Val Arg Thr Leu Leu Leu His Pro 
125 130 135 140 

GCC ATT TTT GGC CTT CAT CAC ATT GGA ATG CAG ATG AGA ATA GCT ATG 600 
50 Ala He Phe Gly Leu His His He Gly Met Gin Met Arg He Ala Met 

145 150 155 

TTT AGT TTG ATT TAT AAG AAG ACT TTA AAG CTG TCA AGC CGT GTT CTA 648 
Phe Ser Leu He Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu 
55 160 165 170 
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GAT AAA ATA AGT ATT GGA CAA CTT GTT ACT CTC CTT TCC AAC AAC CTG 696 

Asp Lys lie Ser lie Gly Gin Leu Val Ser Leu Leu Ser Asn Asn Leu 
175 180 185 

5 

AAC AAA TTT GAT GAA GGA CTT GCA TTG GCA CAT TTC GTG TGG ATC GCT 744 

Asn Lys Phe Asp Glu Gly Leu Ala Leu Ala His Phe Val Trp He Ala 
190 195 200 

10 CCT TTG CAA GTG GCA CTC CTC ATG GGG CTA ATC TGG GAG TTG TTA CAG 792 
Pro Leu Gin Val Ala Leu Leu Met Gly Leu He Trp Glu Leu Leu Gin 
205 210 215 220 

GCG TCT GCC TTC TGT GGA CTT GGT TTC CTG ATA GTC CTT GCC CTT TTT . 840 
15 Ala Ser Ala Phe Cys Gly Leu Gly Phe Leu He Val Leu Ala Leu Phe 

225 230 235 

CAG GCT GGG CTA GGG AGA ATG ATG ATG AAG TAC AGA GAT CAG AGA GCT 888 
Gin Ala Gly Leu Gly Arg Met Met Met Lys Tyr Arg Asp Gin Arg Ala 
20 240 245 250 

GGG AAG ATC AGT GAA AGA CTT GTG ATT ACC TCA GAA ATG ATT GAA AAT 936 
Gly Lys He Ser Glu Arg Leu Val He Thr Ser Glu Met He Glu Asn 
255 260 265 



25 



45 



ATC CAA TCT GTT AAG GCA TAC TGC TGG GAA GAA GCA ATG GAA AAA ATG 984 
He Gin Ser Val Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met 
270 275 280 



30 ATT GAA AAC TTA AGA CM, ACA GAA CTG AAA CTG ACT CGG AAG GCA GCC 1032 
He Glu Asn Leu Arg Gin Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala 
285 290 295 300 

TAT GTG AGA TAC TTC AAT AGC TCA GCC TTC TTC TTC TCA GGG TTC TTT 1080 
35 Tyr Val Arg Tyr Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe 

305 310 315 

GTG GTG TTT TTA TCT GTG CTT CCC TAT GCA CTA ATC AAA GGA ATC ATC 1128 
Val Val Phe Leu Ser Val Leu Pro Tyr Ala Leu He Lys Gly He He 
40 320 325 330 

CTC CGG AAA ATA TTC ACC ACC ATC TCA TTC TGC ATT GTT CTG CGC ATG 1176 
Leu Arg Lys He Phe Thr Thr He Ser Phe Cys He Val Leu Arg Met 
335 340 345 



GCG GTC ACT CGG CAA TTT CCC TGG GCT GTA CAA ACA TGG TAT GAC TCT 1224 
Ala Val Thr Arg Gin Phe Pro Trjp Ala Val Gin Thr Trp Tyr Asp Ser 
350 355 360 



50 CTT GGA GCA ATA AAC AAA ATA CAG GAT TTC TTA CAA AAG CAA GAA TAT 1272 
Leu Gly Ala He Asn Lys He Gin Asp Phe Leu Gin Lys Gin Glu Tyr 
365 370 375 380 

AAG ACA TTG GAA TAT AAC TTA ACG ACT ACA GAA GTA GTG ATG GAG AAT 1320 
55 Lys Thr Leu Glu Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn 

385 390 395 
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GTA ACA GCC TTC TGG GAG GAG GGA TTT GGG GAA TTA TTT GAG AAA GCA 1368 
Val Thr Ala Phe Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala 
400 405 410 

5 

AAA CAA AAC AAT AAC AAT AGA AAA ACT TCT AAT GGT GAT GAC AGC CTC 1416 
Lys Gin Asn Asn Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu 
415 420 425 

10 TTC TTC AGT AAT TTC TCA CTT CTT GGT ACT CCT GTC CTG AAA GAT ATT 1464 
Phe Phe Ser Asn Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp lie 
430 435 440 

AAT TTC AAG ATA GAA AGA GGA CAG TTG TTG GCG GTT GCT GGA TCC ACT 1512 
15 Asn Phe Lys lie Glu Arg Gly Gin Leu Leu Ala Val Ala Gly Ser Thr 
445 450 455 460 

GGA GCA GGC AAG ACT TCA CTT CTA ATG ATG ATT ATG GGA GAA CTG GAG 1560 
Gly Ala Gly Lys Thr Ser Leu Leu Met Met He Met Gly Glu Leu Glu 
20 465 470 475 

CCT TCA GAG GGT AAA ATT AAG CAC AGT GGA AGA ATT TCA TTC TGT TCT 1608 

Pro Ser Glu Gly Lys He Lys His Ser Gly Arg He Ser Phe Cys Ser 
480 485 490 

25 

CAG TTT TCC TGG ATT ATG CCT GGC ACC ATT AAA GAA AAT ATC ATC TTT 1656 

Gin Phe Ser Trp He Met Pro Gly Thr He Lys Glu Asn He He Phe 
495 500 505 

30 GGT GTT TCC TAT GAT GAA TAT AGA TAC AGA AGC GTC ATC AAA GCA TGC 1704 
Gly Val Ser Tyr Asp Glu Tyr Arg Tyr Arg Ser Val He Lys Ala Cys 
510 515 520 

CAA CTA GAA GAG GAC ATC TCC AAG TTT GCA GAG AAA GAC AAT ATA GTT 1752 
35 Gin Leu Glu Glu Asp He Ser Lys Phe Ala Glu Lys Asp Asn He Val 
525 530 535 540 

CTT GGA GAA GGT GGA ATC ACA CTG AGT GGA GGT CAA CGA GCA AGA ATT 1800 
Leu Gly Glu Gly Gly He Thr Leu Ser Gly Gly Gin Arg Ala Arg He 
40 545 550 555 

TCT TTA GCA AGA GCA GTA TAC AAA GAT GCT GAT TTG TAT TTA TTA GAC 1848 
Ser Leu Ala Arg Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp 
560 565 570 

45 

TCT CCT TTT GGA TAC CTA GAT GTT TTA ACA GAA AAA GAA ATA TTT GAA 1896 
Ser Pro Phe Gly Tyr Leu Asp Val Leu Thr Glu Lys Glu He Phe Glu 
575 580 585 

50 AGC TGT GTC TGT AAA CTG ATG GCT AAC AAA ACT AGG ATT TTG GTC ACT 1944 
Ser Cys Val Cys Lys Leu Met Ala Asn Lys Thr Arg He Leu Val Thr 
590 595 600 

TCT AAA ATG GAA CAT TTA AAG AAA GCT GAC AAA ATA TTA ATT TTG CAT 1992 
55 Ser Lys Met Glu His Leu Lys Lys Ala Asp Lys He Lett He Leu His 
605 610 615 620 
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GAA GGT AGC AGC TAT TTT TAT GGG ACA 
Glu Gly Ser Ser Tyr Phe Tyr Gly Thr 
625 

5 

CAG CCA GAC TTT AGC TCA AAA CTC ATG 
Gin Pro Asp Phe Ser Ser Lys Leu Met 
640 645 



TTT TCA GAA CTC CAA AAT CTA 2040 
Phe Ser Glu Leu Gin Asn Leu 
630 635 

GGA TGT GAT TCT TTC GAC CAA 2088 
Gly Cys Asp Ser Phe Asp Gin 
650 



10 TTT AGT GCA GAA AGA AGA AAT TCA ATC CTA ACT GAG ACC TTA CAC CGT 2136 
Phe Ser Ala Glu Arg Arg Asn Ser lie Leu Thr Glu Thr Leu His Arg 
655 660 665 



TTC TCA TTA GAA GGA GAT GCT CCT GTC TCC TGG ACA GAA ACA AAA AAA 2184 
15 Phe Ser Leu Glu Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys 
670 675 680 

CAA TCT TTT AAA CAG ACT GGA GAG TTT GGG GAA AAA AGG AAG AAT TCT 2232 
Gin Ser Phe Lys Gin Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser 
20 685 690 695 700 



ATT CTC AAT CCA ATC AAC TCT ATA CGA AAA TTT TCC ATT GTG CAA AAG 2280 
lie Leu Asn Pro lie Asn Ser lie Arg Lys Phe Ser lie Val Gin Lys 
705 710 715 

25 

ACT CCC TTA CAA ATG AAT GGC ATC GAA GAG GAT TCT GAT GAG CCT TTA 2328 
Thr Pro Leu Gin Met Asn Gly lie Glu Glu Asp Ser Asp Glu Pro Leu 
720 725 730 

30 GAG AGA AGG CTG TCC TTA GTA CCA GAT TCT GAG CAG GGA GAG GCG ATA 2376 
Glu Arg Arg Leu Ser Leu Val Pro Asp Ser Glu Gin Gly Glu Ala lie 
735 740 745 



CTG CCT CGC ATC AGC GTG ATC AGC ACT GGC CCC ACG CTT CAG GCA CGA 2424 
35 Leu Pro Arg He Ser Val He Ser Thr Gly Pro Thr Leu Gin Ala Arg 
750 755 760 



AGG AGG CAG TCT GTC CTG AAC CTG ATG ACA CAC TCA GTT AAC CAA GGT 2472 
Arg Arg Gin Ser Val Leu Asn Leu Met Thr His Ser Val Asn Gin Gly 
40 765 770 775 780 

CAG AAC ATT CAC CGA AAG ACA ACA GCA TCC ACA CGA AAA GTG TCA CTG 2520 
Gin Asn He His Arg Lys Thr Thr Ala Ser Thr Arg Lys Val Ser Leu 
785 790 795 



45 



GCC CCT CAG GCA AAC TTG ACT GAA CTG GAT ATA TAT TCA AGA AGG TTA 2568 
Ala Pro Gin Ala Asn Leu Thr Glu Leu Asp He Tyr Ser Arg Arg Leu 
800 805 810 



50 TCT CAA GAA ACT GGC TTG GAA ATA AGT GAA GAA ATT AAC GAA GAA GAC 2616 
Ser Gin Glu Thr Gly Leu Glu He Ser Glu Glu He Asn Glu Glu Asp 
815 820 825 

TTA AAG GAG TGC CTT TTT GAT GAT ATG GAG AGC ATA CCA GCA GTG ACT 2664 
55 Leu Lys Glu Cys Leu Phe Asp Asp Met Glu Ser He Pro Ala Val Thr 
830 835 840 
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ACA TGG AAC ACA TAG CTT CGA TAT ATT ACT GTC CAC AAG AGC TTA ATT 2712 
Thr Trp T^n Thr Tyr Leu Arg Tyr He Thr Val His Lys Ser Leu He 
845 850 855 860 

5 

TTT GTG CTA ATT TGG TGC TTA GTA ATT TTT CTG GCA GAG GTG GCT GCT 2760 
Phe Val Leu He Trp Cys Leu Val He Phe' Leu Ala Glu Val Ala Ala 
865 870 875 

10 TCT TTG GTT GTG CTG TGG CTC CTT GGA AAC ACT CCT CTT CAA GAC AAA 2808 
Ser Leu Val Val Leu Trp Leu Leu Gly Asn Thr Pro Leu Gin Asp Lys 
880 885 890 

GGG AAT AGT ACT CAT AGT AGA AAT AAC AGC TAT GCA GTG ATT ATC ACC 2856 
15 Gly Asn Ser Thr His Ser Arg Asn Asn Ser Tyr Ala Val He He Thr 
895 900 905 

AGC ACC AGT TCG TAT TAT GTG TTT TAG ATT TAC GTG GGA GTA GCC GAC 2904 
Ser Thr Ser Ser Tyr Tyr Val Phe Tyr He Tyr Val Gly Val Ala Asp 
20 910 915 920 

ACT TTG CTT GCT ATG GGA TTC TTC AGA GGT CTA CCA CTG GTG CAT ACT 2952 
Thr Leu Leu Ala Met Gly Phe Phe Arg Gly Leu Pro Leu Val His Thr 
925 930 935 940 

25 

CTA ATC ACA GTG TCG AAA ATT TTA CAC CAC AAA ATG TTA CAT TCT GTT 3000 
Leu He Thr Val Ser Lys He Leu His His Lys Met Leu His Ser Val 
945 950 955 

30 CTT CAA GCA CCT ATG TCA ACC CTC AAC ACG TTG AAA GCA GGT GGG ATT 3048 
Leu Gin Ala Pro Met Ser Thr Leu Asn Thr Leu Lys Ala Gly Gly He 
960 965 970 

CTT AAT AGA TTC TCC AAA GAT ATA GCA ATT TTG GAT GAC CTT CTG CCT 3096 
35 Leu Asn Arg Phe Ser Lys Asp He Ala He Leu Asp Asp Leu Leu Pro 
975 980 985 

CTT ACC ATA TTT GAC TTC ATC CAG TTG TTA TTA ATT GTG ATT GGA GCT 3144 
Leu Thr He Phe Asp Phe He Gin Leu Leu Leu He Val He Gly Ala 
40 990 995 1000 

ATA GCA GTT GTC GCA GTT TTA CAA CCC TAC ATC TTT GTT GCA ACA GTG 3192 

He Ala Val Val Ala Val Leu Gin Pro Tyr He Phe Val Ala Thr Val 
1005 1010 1015 1020 

45 

CCA GTG ATA GTG GCT TTT ATT ATG TTG AGA GCA TAT TTC CTC CAA ACC 3240 

Pro Val He Val Ala Phe He Met Leu Arg Ala Tyr Phe Leu Gin Thr 

1025. 1030 1035 

50 TCA CAG CAA CTC AAA CAA CTG GAA TCT GAA GGC AGG AGT CCA ATT TTC 3288 
Ser Gin Gin Leu Lys Gin Leu Glu Ser Glu Gly Arg Ser Pro He Phe 
1040 1045 1050 



ACT CAT CTT GTT ACA AGC TTA AAA GGA CTA TGG ACA CTT CGT GCC TTC 3336 
55 Thr His Leu Val Thr Ser Leu Lys Gly Leu Trp Thr Leu Arg Ala Phe 
1055 1060 1065 
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GGA CGG CAG CCT TAG TTT GAA ACT CTG TTC CAC AAA GCT CTG AAT TTA 3384 
Gly Arg Gin Pro Tyr Phe Glu Thr Leu Phe His Lys Ala Leu Asn Leu 
1070 1075 1080 

5 

CAT ACT GCC AAC TGG TTC TTG TAC CTG TCA ACA CTG CGC TGG TTC CAA' 3432 
His Thr Ala Asn Trp Phe Leu Tyr Leu Ser Thr Leu Arg Trp Phe Gin 
1085 1090 1095 1100 

10 ATG AGA ATA GAA ATG ATT TTT GTC ATC TTC TTC ATT GCT GTT ACC TTC 3480 
Met Arg He Glu Met He Phe Val He Phe Phe He Ala Val Thr Phe 
1105 1110 1115 

ATT TCC ATT TTA ACA ACA GGA GAA GGA GAA GGA AGA GTT GGT ATT ATC 3528 
15 He Ser He Leu Thr Thr Gly Glu Gly Glu Gly Arg Val Gly He He 
1120 1125 1130 

CTG ACT TTA GCC ATG AAT ATC ATG AGT ACA TTG CAG TGG GCT GTA AAC 3576 
Leu Thr Leu Ala Met Asn He Met Ser Thr Leu Gin Trp Ala Val Asn 
20 1135 1140 1145 

• TCC AGC ATA GAT GTG GAT AGC TTG ATG CGA TCT GTG AGC CGA GTC TTT 3624 
Ser Ser He Asp Val Asp Ser Leu Met Arg Ser Val Ser Arg Val Phe 
1150 1155 1160 

25 

AAG TTC ATT GAC ATG CCA ACA GAA GGT AAA CCT ACC AAG TCA ACC AAA 3672 
Lys Phe He Asp Met Pro Thr Glu Gly Lys Pro Thr Lys Ser Thr Lys 
1165 1170 1175 1180 

30 CCA TAC AAG AAT GGC CAA CTC TCG AAA GTT ATG ATT ATT GAG AAT TCA 3720 
Pro Tyr Lys Asn Gly Gin Leu Ser Lys Val Met He He Glu Asn Ser 
1185 1190 1195 

CAC GTG AAG AAA GAT GAC ATC TGG CCC TCA GGG GGC CAA ATG ACT GTC 3768 
35 His Val Lys Lys Asp Asp He Trp Pro Ser Gly Gly Gin Met Thr Val 
1200 1205 1210 

AAA GAT CTC ACA GCA AAA TAC ACA GAA GGT GGA AAT GCC ATA TTA GAG 3816 
Lys Asp Leu Thr Ala Lys Tyr Thr Glu Gly Gly Asn Ala He Leu Glu 
40 1215 1220 1225 

AAC ATT TCC TTC TCA ATA AGT CCT GGC CAG AGG GTG GGC CTC TTG GGA 3864 
Asn He Ser Phe Ser He Ser Pro Gly Gin Arg Val Gly Leu Leu Gly 
1230 1235 1240 

45 

AGA ACT GGA TCA GGG AAG AGT ACT TTG TTA TCA GCT TTT TTG AGA CTA 3912- 
Arg Thr Gly Ser Gly Lys Ser Thr Leu Leu Ser Ala Phe Leu Arg Leu 
1245 1250 1255 1260 

50 CTG AAC ACT GAA GGA GAA ATC CAG ATC GAT GGT GTG TCT TGG GAT TCA 3960 
Leu Asn Thr Glu Gly Glu He Gin He Asp Gly Val Ser Trp Asp Ser 
1265 1270 1275 

ATA ACT TTG CAA CAG TGG AGG AAA GCC TTT GGA GTG ATA CCA CAG AAA 4008 
55 He Thr Leu Gin Gin Trp Arg Lys Ala Phe Gly Val He Pro Gin Lys 
1280 1285 1290 
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GTA TTT ATT TTT TCT GGA ACA TTT AGA AAA AAC TTG GAT CCC TAT GAA 4056 
Val Phe lie Phe Ser Gly Thr Phe Arg Lys Asn Leu Asp Pro Tyr Glu 
1295 * 1300 1305 

5 

CAG TGG AGT GAT CAA GAA ATA TGG AAA GTT GCA GAT GAG GTT GGG CTC 4104 
Gin Trp Ser Asp Gin Glu He Trp Lys Val Ala Asp Glu Val Gly Leu 
1310 1315 1320 

10 AGA TCT GTG ATA GAA CAG TTT CCT GGG AAG CTT GAC TTT GTC CTT GTG 4152 
Arg Ser Val He Glu Gin Phe Pro Gly Lys Leu Asp Phe Val Leu Val 
1325 1330 1335 1340 

GAT GGG GGC TGT GTC CTA AGC CAT GGC CAC AAG CAG TTG ATG TGC TTG 4200 
15 Asp Gly Gly Cys Val Leu Ser His Gly His Lys Gin Leu Met Cys Leu 

1345 1350 1355 

GCT AGA TCT GTT CTC AGT AAG GCG AAG ATC TTG CTG CTT GAT GAA CCC 4248 
Ala Arg Ser Val Leu Ser Lys Ala Lys He Leu Leu Leu Asp Glu Pro 
20 1360 1365 1370 

AGT GCT CAT TTG GAT CCA GTA ACA TAC CAA ATA ATT AGA AGA ACT CTA 4296 
Ser Ala His Leu Asp Pro Val Thr Tyr Gin He He Arg Arg Thr Leu 
1375 1380 1385 

25 

AAA CAA GCA TTT GCT GAT TGC ACA GTA ATT CTC TGT GAA CAC AGG ATA 4344 
Lys Gin Ala Phe Ala Asp Cys Thr Val He Leu Cys Glu His Arg He 
1390 1395 1400 

30 GAA GCA ATG CTG GAA TGC CAA CAA TTT TTG GTC ATA GAA GAG AAC AAA 4392 
Glu Ala Met Leu Glu Cys Gin Gin Phe Leu Val He Glu Glu Asn Lys 
1405 1410 1415 1420 

GTG CGG CAG TAC GAT TCC ATC CAG AAA CTG CTG AAC GAG AGG AGC CTC 4440 
35 Val Arg Gin Tyr Asp Ser He Gin Lys Leu Leu Asn Glu Arg Ser Leu 

1425 1430 1435 

TTC CGG CAA GCC ATC AGC CCC TCC GAC AGG GTG AAG CTC TTT CCC CAC 4466 
Phe Arg Gin Ala He Ser Pro Ser Asp Arg Val- Lys Leu Phe Pro His 
40 1440 1445 1450 

CGG AAC TCA AGC AAG TGC AAG TCT AAG CCC CAG ATT GCT GCT CTG AAA 4536 

Arg Asn Ser Ser Lys Cys Lys Ser Lys Pro Gin He Ala Ala Leu Lys 

1455 1460 1465 

45 

GAG GAG ACA GAA GAA GAG GTG CAA GAT ACA AGG CTT TAGAGAGCAG 4582 

Glu Glu Thr Glu Glu Glu Val Gin Asp Thr Arg Leu 

1470 1475 1460 ^ 

50 CATAAATGTT GACATGGGAC ATTTGCTCAT GGAATTGGAG CTCGTGGGAC AGTCACCTCA 4642 

TGGAATTGGA GCTCGTGGAA CAGTTACCTC TGCCTCAGAA AACAAGGATG AATTAAGTTT 4702 

TTTTTTAAAA AAGAAACATT TGGTAAGGGG AATTGAGGAC ACTGATATGG GTCTTGATAA 4762 

55 

ATGGCTTCCT GGCAATAGTC AAATTGTGTG A7UVGGTACTT CAAATCCTTG AAGATTTACC 4822 

ACTTGTGTTT TGCAAGCCAG ATTTTCCTGA AAACCCTTGC CATGTGCTAG TAATTGGAAA 4882 
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GGCAGCTCTA AATGTCAATC AGCCTAGTTG ATCAGCTTAT TGTCTAGTGA AACTCGTTAA 4942 

TTTGTAGTGT TGGAGAAGAA CTGAAATCAT ACTTCTTAGG GTTATGATTA AGTAATGATA 5002 

5 ACTGGAAACT TCAGCGGTTT ATATAAGCTT GTATTCCTTT TTCTCTCCTC TCCCCATGAT 5062 

GTTTAGAAAC ACAACTATAT TGTTTGCTAA GCATTCCAAC TATCTCATTT CCAAGCAAGT 5122 

ATTAGAATAC CACAGGAACC ACAAGACTGC ACATCAAAAT ATGCCCCATT CAACATCTAG 5182 

10 

TGAGCAGTCA GGAAAGAGAA CTTCCAGATC CTGGAAATCA GGGTTAGTAT TGTCCAGGTC 5242 

TACCAAAAAT CTCAATATTT CAGATAATCA CAATACATCC CTTACCTGGG AAAGGGCTGT 5302 

15 TATAATCTTT CACAGGGGAC AGGATGGTTC CCTTGATGAA GAAGTTGATA TGCCTTTTCC 5362 

CAACTCCAGA AAGTGACAAG CTCACAGACC TTTGAACTAG AGTTTAGCTG GAAAAGTATG 5422 

TTAGTGCAAA TTGTCACAGG ACAGCCCTTC TTTCCACAGA AGCTCCAGGT AGAGGGTGTG 5482 

20 

TAAGTAGATA GGCCATGGGC ACTGTGGGTA GACACACATG AAGTCCAAGC ATTTAGATGT 5542 

ATAGGTTGAT GGTGGTATGT TTTCAGGCTA GATGTATGTA CTTCATGCTG TCTACACTAA 5602 

25 GAGAGAATGA GAGACACACT GAAG7AGCAC CAATCATGAA TTAGTTTTAT ATGCTTCTGT 5662 

TTTATAATTT TQTGAAGCAA AATTTTTTCT CTAGGAAATA TTTATTTTAA TAATGTTTCA 5722 

AACATATATT ACAATGCTGT ATTTTAAAAG AATGATTATG AATTACATTT GTATAAAATA 5782 

30 

ATTTTTATAT TTGAAATATT GACTTTTTAT GGCACTAGTA TTTTTATGAA ATATTATGTT 5842 

AAAACTGGGA CAGGGGAGAA CCTAGGGTGA TATTAACCAG GGGCCATGAA TCACCTTTTG 5902 
35 GTCTGGAGGG AAGCCTTGGG GCTGATCGAG TTGTTGCCCA CAGCTGTATG ATTCCCAGCC * 5962 

AGACACAGCC TCTTAGATGC AGTTCTGAAG AAGATGGTAC CACCAGTCTG ACTGTTTCCA 6022 

TCAAGGGTAC ACTGCCTTCT CAACTCCAAA CTGACTCTTA AGAAGACTGC ATTATATTTA 6082 

40 

TTACTGTAAG AAAATATCAC TTGTCAATAA AATCCATACA TTTGTGT 6129 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1480 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



55 Met Gin Arg Ser Pro Leu Glu Lys Ala Ser Val Val Ser Lys Leu Phe 
1 5 10 15 
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Phe Ser Trp Thr Arg Pro He Leu Arg Lys Gly Tyr Arg Gin Arg Leu 
20 25 30 

5 Glu Leu Ser Asp He Tyr Gin He Pro Ser Val Asp Ser Ala Asp Asn 
35 40 45 

Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys 
50 55 60 

10 

Lys Asn Pro Lys Leu He Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg 
65 70 75 80 

Phe Met Phe Tyr Gly He Phe Leu Tyr Leu Gly Glu Val Thr Lys Ala 
15 85 90 95 

Val Gin Pro Leu Leu Leu Gly Arg He He Ala Ser Tyr Asp Pro Asp 
100 105 110 

20 Asn Lys Glu Glu Arg Ser He Ala He Tyr Leu Gly He Gly Leu Cys 
115 120 125 

Leu Leu Phe He Val Arg Thr Leu Leu Leu His Pro Ala He Phe Gly 
130 135 140 

25 

Leu His His He Gly Met Gin Met Arg He Ala Met Phe Ser Leu He 
145 150 155 160 

Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys He Ser 
30 165 170 175 

He Gly Gin Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp 
180 185 190 

35 Glu Gly Leu Ala Leu Ala His Phe Val Trp He Ala Pro Leu Gin Val 
195 200 205 

Ala Leu Leu Met Gly Leu He Trp Glu Leu Leu Gin Ala Ser Ala Phe 
210 215 220 

40 

Cys Gly Leu Gly Phe Leu He Val Leu Ala Leu Phe Gin Ala Gly Leu 
225 230 235 240 

Gly Arg Met Met Met Lys Tyr Arg Asp Gin Arg Ala Gly Lys He Ser 
45 245 250 255 

Glu Arg Leu Val He Thr Ser Glu Met He Glu Asn He Gin Ser Val 
260 265 270 

50 Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met He Glu Asn Leu 
275 280 285 

Arg Gin Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr 
.290 295 300 

55 

Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu 
305 310 315 320 
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Ser Val Leu Pro Tyr Ala Leu He Lys Gly He He Leu Arg Lys He 
325 330 335 

Phe Thr Thr He Ser Phe Cys He Val Leu Arg Met Ala Val Thr Arg 
340 345 350 

Gin Phe Pro Trp Ala Val Gin Thr Trp Tyr Asp Ser Leu Gly Ala He 
355 360 365 

Asn Lys He Gin Asp Phe Leu Gin Lys Gin Glu Tyr Lys Thr Leu Glu 
370 375 380 



Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe 
15 385 390 395 400 

Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gin Asn Asn 
405 410 415 

20 Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn 
420 425 430 



25 



Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp He Asn Phe Lys He 
435 440 445 

Glu Arg Gly Gin Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys 
450 455 460 



Thr Ser Leu Leu Met Met He Met Gly Glu Leu Glu Pro Ser Glu Gly 
30 465 470 475 480 

Lys He Lys His Ser Gly Arg He Ser Phe Cys Ser Gin Phe Ser Trp 
485 490 495 

35 He Met Pro Gly Thr He Lys Glu Asn He He Phe Gly Val Ser Tyr 
500 505 510 



40 



Asp Glu Tyr Arg Tyr Arg Ser Val He Lys Ala Cys Gin Leu Glu Glu 
515 520 525 

Asp He Ser Lys Phe Ala Glu Lys Asp Asn He Val Leu Gly Glu Gly 
530 535 540 



Gly He Thr Leu Ser Gly Gly Gin Arg Ala Arg He Ser Leu Ala Arg 

45 545 550 555 560 

Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly 

565 570 575 

50 Tyr Leu Asp Val Leu Thr Glu Lys Glu He Phe Glu Ser Cys Val Cys 

580 585 590 



55 



Lys Leu Met Ala Asn Lys Thr Arg He Leu Val Thr Ser Lys Met Glu 
595 600 605 

His Leu Lys Lys Ala Asp Lys He Leu He Leu His Glu Gly Ser Ser 

610 615 620 
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Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gin Asn Leu Gin Pro Asp Phe 
625 630 635 640 

5 Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gin Phe Ser Ala Glu 

645 650 655 

Arg Arg Asn Ser lie Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu 
660 665 670 

10 

Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gin Ser Phe Lys 
675 680 685 

Gin Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser lie Leu Asn Pro 
15 690 695 700 

lie Asn Ser lie Arg Lys Phe Ser He Val Gin Lys Thr Pro Leu Gin 
705 710 715 720 

20 Met Asn Gly He Glu Glu Asp Ser Asp Glu Pro Leu Glu Arg Arg Leu 

725 730 735 

Ser Leu Val Pro Asp Ser Glu Gin Gly Glu Ala He Leu Pro Arg He 
740 745 750 

25 

Ser Val He Ser Thr Gly Pro Thr Leu Gin Ala Arg Arg Arg Gin Ser 
755 760 765 

Val Leu Asn Leu Met Thr His Ser Val Asn Gin Gly Gin Asn He His 
30 770 775 780 

Arg Lys Thr Thr Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gin Ala 
785 790 795 800 

35 Asn Leu Thr Glu Leu Asp He Tyr Ser Arg Arg Leu Ser Gin Glu Thr 

805 810 815 

Gly Leu Glu He Ser Glu Glu He Asn Glu Glu Asp Leu Lys Glu Cys 
820 825 830 

40 

Leu Phe Asp Asp Met Glu Ser He Pro Ala Val Thr Thr Trp Asn Thr 
835 840 845 

Tyr Leu Arg Tyr He Thr Val His Lys Ser Leu He Phe Val Leu He 
45 850 855 860 

Trp Cys Leu Val He Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val 
865 870 875 880 

50 Leu Trp Leu Leu Gly Asn Thr Pro Leu Gin Asp Lys Gly Asn Ser Thr 

885 890 895 

His Ser Arg Asn Asn Ser Tyr Ala Val He He Thr Ser Thr Ser Ser 
900 905 910 

55 

Tyr Tyr Val Phe Tyr He Tyr Val Gly Val Ala Asp Thr Leu Leu Ala 
915 920 925 
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Met aiy Phe Phe Arg Gly Leu Pro Leu Val His Thr Leu lie Thr Val 
930 935 940 

5 Ser Lys lie Leu His His Lys Met Leu His Ser Val Leu Gin Ala Pro 
945 950 955 960 



10 



25 



40 



55 



Met Ser Thr Leu Asn Thr Leu Lys Ala Gly Gly He Leu Asn Arg Phe 
965 970 975 

Ser Lys Asp He Ala He Leu Asp Asp Leu Leu Pro Leu Thr He Phe 
980 985 990 



Asp Phe He Gin Leu Leu Leu He Val He Gly Ala He Ala Val Val 
15 995 1000 1005 

Ala Val Leu Gin Pro Tyr He Phe Val Ala Thr Val Pro Val He Val 
1010 1015 1020 

20 Ala Phe He Met Leu Arg Ala Tyr Phe Leu Gin Thr Ser Gin Gin Leu 
1025 1030 1035 1040 



Lys Gin Leu Glu Ser Glu Gly Arg Ser Pro He Phe Thr His Leu Val 
1045 1050 1055 

Thr Ser Leu Lys Gly Leu Trp Thr Leu Arg Ala Phe Gly Arg Gin Pro 
1060 1065 1070 



Tyr Phe Glu Thr Leu Phe His Lys Ala Leu Asn Leu His Thr Ala Asn 
30 1075 1080 1085 

Trp Phe Leu Tyr Leu Ser Thr Leu Arg Trp Phe Gin Met Arg He Glu 
1090 1095 1100 

35 Met He Phe Val He Phe Phe He Ala Val Thr Phe He Ser He Leu 
1105 1110 1115 H20 



Thr Thr Gly Glu Gly Glu Qly Arg Val Gly He He Leu Thr Leu Ala 
1125 1130 1135 

Met Asn He Met Ser Thr Leu Gin Trp Ala Val Asn Ser Ser He Asp 
1140 1145 1150 



Val Asp Ser Leu Met Arg Ser Val Ser Arg Val Phe Lys Phe He Asp 
45 1155 1160 1165 

Met Pro Thr Glu Gly Lys Pro Thr Lys Ser Thr Lys Pro Tyr Lys Asn 
1170 1175 1180 

50 Gly Gin Leu Ser Lys Val Met He He Glu Asn Ser His Val Lys Lys 
1185 1190 1195 1200 



Asp Asp He Trp Pro Ser Gly Gly Gin Met Thr Val Lys Asp Leu Thr 
1205 1210 1215 

Ala Lys Tyr Thr Glu Gly Gly Asn Ala He Leu Glu Asn He Ser Phe 
1220 1225 1230 
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Ser lie Ser Pro Gly Gin Arg Val Gly Leu Leu Gly Arg Thr Gly Ser 
1235 1240 1245 

5 Gly Lys Ser Thr Leu Leu Ser Ala Phe Leu Arg Leu Leu Asn Thr Glu 
1250 1255 1260 

Gly Glu lie Gin lie Asp Gly Val Ser Trp Asp Ser lie Thr Leu Gin 
1265 1270 1275 1280 

10 

Gin Trp Arg Lys Ala Phe Gly Val lie Pro Gin Lys Val Phe lie Phe 
1285 1290 1295 

Ser Gly Thr Pfee Arg Lys Asn Leu Asp Pro Tyr Glu Gin Trp Ser Asp 
15 1300 1305 1310 

Gin Glu lie Trp Lys Val Ala Asp Glu Val Gly Leu Arg Ser Val lie 
1315 1320 1325 

20 Glu Gin Phe Pro Gly Lys Leu Asp Phe Val Leu Val Asp Gly Gly Cys 
1330 1335 1340 

Val Leu Ser His Gly His Lys Gin Leu Met Cys Leu Ala Arg Ser Val 
. 1345 1350 1355 1360 

25 

Leu Ser Lys Ala Lys lie Leu Leu Leu Asp Glu Pro Ser Ala His Leu 
1365 1370 1375 

' Asp Pro Val Thr Tyr Gin lie He Arg Arg Thr Leu Lys Gin Ala Phe 
30 1380 1385 1390 

Ala Asp Cys Thr Val He Leu Cys Glu His Arg He Glu Ala Met Leu 
1395 1400 1405 

35 Glu Cys Gin Gin Phe Leu Val He Glu Glu Asn Lys Val Arg Gin Tyr 
1410 1415 1420 

Asp Ser He Gin Lys Leu Leu Asn Glu Arg Ser Leu Phe Arg Gin Ala 
1425 1430 1435 1440 

40 

He Ser Pro Ser Asp Arg Val Lys Leu Phe Pro His Arg Asn Ser Ser 
1445 1450 1455 

Lys Cys Lys Ser Lys Pro Gin He Ala Ala Leu Lys Glu Glu Thr Glu 
45 1460 1465 1470 

Glu Glu Val Gin Asp Thr Arg Leu 
1475 1480 

50 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5635 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG GGGGTGGAGT 60 

5 

TTGTGACGTG GCGCGGGGCG TGGGAAC6GG GCGGGTGACG TAGTA6TGTG GCGGAAGTGT 120 

GATGTTGCAA GTGTGGCGGA ACACATGTAA GCGCCGGATG TGGTAAAAGT GACGTTTTTG 180 

10 GTGTGCGCCG GTGTATACGG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG 240 

TA7ATTTGGG CGTAACCAAG TAATGTTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 300 

AGTGAAATCT GAATAATTCT GTGTTACTCA TAGCGCGTAA TATTTGTCTA GGGCCGCGGG 360 

15 

GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTGTT TTCCGCGTTC 420 

CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG CGCAGTGTAT TTATACCCGG 480 

20 TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 540 

TCCGAGCTAG TAACGGCCGC CAGTGTGCTG CAGATATCAA AGTCGACGGT ACCCGAGAGA 600 

CCATGCAGAG GTCGCCTCTG GAAAAGGCCA GCGTTGTCTC CAAACTTTTT TTCAGCTGGA 660 

25 . 

CCAGACCAAT TTTGAGGAAA GGATACAGAC AGCGCCTGGA ATTGTCAGAC ATATACCAAA 720 

TCCCTTCTGT TGATTCTGCT GACAATCTAT CTGAAAAATT GGAAAGAGTU^ TGGGATAGAG 780 

30 AGCTGGCTTC AAAGT^AAAAT CCTAAACTCA TTAATGCCCT TCGGCGATGT TTTTTCTGGA 840 

GATTTATGTT CTATGGAATC TTTTTATATT TAGGGGAAGT CACCAAAGCA GTACAGCCTC 900 

TCTTACTGGG AAGAATCATA GCTTCCTATG ACCCGGATAA CAAGGAGGAA CGCTCTATCG 960 

35 

CGATTTATCT AGGCATAGGC TTATGCCTTC TCTTTATTGT GAGGACACTG CTCCTACACC 1020 

CAGCCATTTT TGGCCTTCAT CACATTGGAA TGCAGATGAG AATAGCTATG TTTAGTTTGA 1080 

40 TTTATAAGAA GACTTTAAAG CTGTCAAGCC GTGTTCTAGA TAAAATAAGT ATTGGACAAC 1140 

TTGTTAGTCT CCTTTCCAAC AACCTGAACA AATTTGATGA AGGACTTGCA TTGGCACATT 1200 

TCGTGTGGAT CGCTCCTTTG CAAGTGGCAC TCCTCATGGG GCTAATCTGG GAGTTGTTAC 1260 

45 

AGGCGTCTGC CTTCTGTGGA CTTGGTTTCC TGATAGTCCT TGCCCTTTTT CAGGCTGGGC 1320 

TAGGGAGAAT GATGATGAAG TACAGAGATC AGAGAGCTGG GAAGATCAGT GAAAGACTTG 1380 

50 TGATTACCTC AGAAATGATT GAAAACATCC AATCTGTTAA GGCATACTGC TGGGAAGAAG 1440 

CAATGGAAAA AATGATTGAA AACTTAAGAC AAACAGAACT GAAACTGACT CGGAAGGCAG 1500 

CCTATGTGAG ATACTTCAAT AGCTCAGCCT TCTTCTTCTC AGGGTTCTTT GTGGTGTTTT 1560 

55 

TATCTGTGCT TCCCTATGCA CTAATCAAAG GAATCATCCT CCGGAAAATA TTCACCACCA 1620 

TCTCATTCTG CATTGTTCTG CGCATGGCGG TCACTCGGCA ATTTCCCTGG GCTGTACAAA 1680 
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20 
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CATGGTATGA CTCTCTTGGA GCAATAAACA AAATACAGGA TTTCTTACAA AAGCAAGAAT 1740 

ATAAGACATT GGAATATAAC TTAACGACTA CAGAAGTAGT QATQGAGAAT GTAACAGCCT 1800 

5 TCTGGGAGGA GGGATTTGGG GAATTATTTG AGAAAGCAAA ACAAAACAAT AACAATAGAA 1860 

AAACTTCTAA TGGTGATGAC AGCCTCTTCT TCAGTAATTT CTCACTTCTT GGTACTCCTG 1920 

TCCTGAAAGA TATTAATTTC AAGATAGAAA GAGGACAGTT GTTGGCGGTT GCTGGATCCA 1980 

CTGGAGCAGG CAAGACTTCA CTTCTAATGA TGATTATGGG AGAACTGGAG CCTTCAGAGG 2040 

GTAAAATTAA GCACAGTGGA AGAATTTCAT TCTGTTCTCA GTTTTCCTGG ATTATGCCTG 2100 

GCACCATTAA AGAAAATATC ATCTTTGGTG TTTCCTATGA TGAATATAGA TACAGAAGCG 2160 

TCATCAAAGC ATGCCAACTA GAAGAGGACA TCTCCAAGTT TGCAGAGAAA GACAATATAG 2220 

TTCTTGGAGA AGGTGGAATC ACACTGAGTG GAGGTCAACG AGCAAGAATT TCTTTAGCAA 2280 

GAGCAGTATA CAAAGATGCT GATTTGTATT TATTAGACTC TCCTTTTGGA TACCTAGATG 2340 

TTTTAACAGA AAAAGAAATA TTTGAAAGCT GTGTCTGTAA ACTGATGGCT AACAAAACTA 2400 

25 GGATTTTGGT CACTTCTAAA ATGGAACATT TAAAGAAAGC TGACAAAATA TTAATTTTGC 2460 

ATGAAGGTAG CAGCTATTTT TATGGGACAT TTTCAGAACT CCAAAATCTA CAGCCAGACT 2520 

TTAGCTCAAA ACTCATGGGA TGTGATTCTT TCGACCAATT TAGTGCAGAA AGAAGAAATT 2580 

CAATCCTAAC TGAGACCTTA CACCGTTTCT CATTAGAAGG AGATGCTCCT GTCTCCTGGA 2640 

CAGAAACAAA AAAACAATCT TTTAAACAGA CTGGAGAGTT TGGGGAAAAA AGGAAGAATT 2700 

CTATTCTCAA TCCAATCAAC TCTATACGAA AATTTTCCAT TGTGCAAAAG ACTCCCTTAC 2760 

AAATGAATGG CATCGAAGAG GATTCTGATG AGCCTTTAGA GAGAAGGCTG TCCTTAGTAC 2820 

CAGATTCTGA GCAGGGAGAG GCGATACTGC CTCGCATCAG CGTGATCAGC ACTGGCCCCA 2880 

CGCTTCAGGC ACGAAGGAGG CAGTCTGTCC TGAACCTGAT GACACACTCA GTTAACCAAG 2940 

GTCAGAACAT TCACCGAAAG ACAACAGCAT CCACACGAAA AGTGTCACTG GCCCCTCAGG 3000 

CAAACTTGAC TGAACTGGAT ATATATTCAA GAAGGTTATC TCAAGAAACT GGCTTGGA/UV 3060 

TAAGTGAAGA AATTAACGAA GAAGACTTAA AGGAGTGCCT TTTTGATGAT ATGGAGAGCA 3120 

TACCAGCAGT GACTACATGG AACACATACC TTCGATATAT TACTGTCCAC AAGAGCTTAA 3180 

•TTTTTGTGCT AATTTGGTGC TTAGTAATTT TTCTGGCAGA GGTGGCTGCT TCTTTGGTTG 3240 

TGCTGTGGCT CCTTGGAAAC ACTCCTCTTC AAGACAAAGG GAATAGTACT CATAGTAGAA 3300 

55 ATAACAGCTA TGCAGTGATT ATCACCAGCA CCAGTTCGTA TTATGTGTTT TACATTTACG 3360 

TGGGAGTAGC CGACACTTTG CTTGCTATGG GATTCTTCAG AGGTCTACCA CTGGTGCATA 3420 

CTCTAATCAC AGTGTCGAAA ATTTTACACC ACAAAATGTT ACATTCTGTT CTTCAAGCAC 3480 
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CTATGTCAAC CCTCAACACG TTGAAAGCAG GTGGGATTCT TAATAGATTC TCCA/^GATA 3540 

TAGCAATTTT GGATGACCTT CTGCCTCTTA CCATATTTGA CTTCATCCAG TTGTTATTAA 3600 

5 

TTGTGATTGG AGCTATAGCA GTTGTCGCAG TTTTACAACC CTACATCTTT GTTGCAACAG 3660 

TGCCAGTGAT AGTGGCTTTT ATTATGTTGA GAGCATATTT CCTCCAT^CC TCACAGCAAC 3720 

10 TCAAACAACT GGAATCTGAA GGCAGGAGTC CAATTTTCAC TCATCTTGTT ACAAGCTTAA 3780 

AAGGACTATG GACACTTCGT GCCTTCGGAC GGCAGCCTTA CTTT6AAACT CTGTTCCACA 3840 

AAGCTCTGAA TTTACATACT GCCAACTGGT TCTTGTACCT GTCAACACTG CGCTGGTTCC 3900 

15 

AAATGAGAAT AGAAATGATT TTTGTCATCT TCTTCATTGC TGTTACCTTC ATTTCCATTT 3960 

TAACAACAGG AGAAGGAGAA GGAAGAGTTG GTATTATCCT GACTTTAGCC ATGAATATCA 4020 

20 TGAGTACATT GCAGTGGGCT GTAAACTCCA GCATAGATGT GGATAGCTTG ATGCGATCTG 4080 

TGAGCCGAGT CTTTAAGTTC ATTGACATGC CAACAGAAGG TAAACCTACC AAGTCAACCA 4140 

AACCATACAA GAATGGCCAA CTCTCGAAAG TTATGATTAT TGAGT^TTCA CACGTGAAGA 4200 

25 

AAGATGACAT CTGGCCCTCA GGGGGCCAAA TGACTGTCAA AGATCTCACA GCAAAATACA 4260 

CAGAAGGTGG AAATGCCATA TTAGAGAACA TTTCCTTCTC AATAAGTCCT GGCCAGAGGG 4320 

30 TGGGCCTCTT GGGAAGAACT GGATCAGGGA AGAGTACTTT GTTATCAGCT TTTTTGAGAC 4380 

TACTGAACAC TGAAGGAGAA ATCCAGATCG ATGGTGTGTC TTGGGATTCA ATAACTTTGC 4440 

AACAGTGGAG GAAAGCCTTT GGAGTGATAC CACAGAAAGT ATTTATTTTT TCTGGAACAT 4500 

35 

TTAGAAAAAA CTTGGATCCC TATGAACAGT GGAGTGATCA AGAAATATGG AAAGTTGCAG 4560 

ATGAGGTTGG GCTCAGATCT GTGATAGAAC AGTTTCCTGG GAAGCTTGAC TTTGTCCTTG 4620 

40 TGGATGGGGG CTGTGTCCTA AGCCATGGCC ACAAGCAGTT GATGTGCTTG GCTAGATCTG 4680 

TTCTCAGTAA GGCGAAGATC TTGCTGCTTG ATGAACCCAG TGCTCATTTG GATCCAGTAA 4740 

CATACCAAAT AATTAGAAGA ACTCTAAAAC AAGCATTTGC TGATTGCACA GTAATTCTCT 4800 

45 

GTGAACACAG GATAGAAGCA ATGCTGGAAT GCCAACAATT TTTGGTCATA GAAGAGAACA 4860 

AAGTGCGGCA GTACGATTCC ATCCAGi\AAC TGCTGAACGA GAGGAGCCTC TTCCGGCAAG 4920 

50 CCATCAGCCC CTCCGACAGG GTGAAGCTCT TTCCCCACCG GAACTCAAGC AAGTGCAAGT 4980 

CTAAGCCCCA GATTGCTGCT CTGAAAGAGG AGACAGAAGA AGAGGTGCAA GATACAAGGC 5040 

TTTAGAGAGC AGCATAAATG TTGACATGGG ACATTTGCTC ATGGAATTGG AGGTAGCGGA 5100 

TTGAGGTACT GAAATGTGTG GGCGTGGCTT AAGGGTGGGA AAGAATATAT AAGGTGGGGG 5160 

TCTCATGTAG TTTTGTATCT GTTTTGCAGC AGCCGCCGCC ATGAGCGCCA ACTCGTTTGA 5220 



55 



-113- 

TGGAAGCATT GTGAGCTCAT ATTTGACAAC GCGCATGCCC CCATGGGCCG GGGTGCGTCA 5280 

GAATGTGATG GGCTCCAGCA TTGATGGTCG CCCCGTCCTG CCCGCAAACT CTACTACCTT 5340 

i GACCTACGAG ACCGTGTCTG GAACGCCGTT GGAGACTGCA GCCTCCGCCG CCGCTTCAGC 5400 

CGCTGCAGCC ACCGCCCGCG GGATTGTGAC TGACTTTGCT TTCCTGAGCC CGCTTGCAAG 5460 

CAGTGCAGCT TCCCGTTCAT CCGCCC6CGA TGACAAGTTG ACGGCTCTTT TGGCACAATT 5520 

GGATTCTTTG ACCCGGGAAC TTAATGTCGT TTCTCAGCAG CTGTTGGATC TGCGCCAGCA 5580 

GGTTTCTGCC CTGAAGGCTT CCTCCCCTCC CAATGCGGTT TAAAACATAA ATAAA 5635 
15 (2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
ACTCTTGAGT GCCAGCGAGT AGAGTTTTCT CCTCCG 36 
(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
35 (B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

45 GCAAAGGA6C GATCCACACG AAATGTGCC 29 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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CTCCTCCGAG CCGCTCCGAG CTAG 

(2) INFORMATION FOR SEQ ID NO: 7: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
CCAAAAATGG CTGGGTGTAG GAGCAGTGTC C 
20 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
CGGATCCTTT ATTATAGGGG AAGTCCACGC CTAC 
(2) INFORMATION FOR SEQ ID NO: 9: 
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35 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear. 



•45 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 
50 CGGGATCCAT CGATGAAATA TGACTACGTC CG 
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Claims 

1 . An adenovirus-based gene therapy vector comprising the genome of an adenovirus 2 
serotype in which the Ela and Elb regions of the genome, which are involved in early stages 

5 of viral replication, have been deleted and replaced by genetic material of interest. 

2. The adenovirus-based gene therapy vector of claim 1 , wherein the genetic material of 
interest is DNA encoding cystic fibrosis transmembrane conductance regulator 

10 3 . The adenovirus-based gene therapy vector of claim 1 further comprising PGK 
promoter operably linked to the genetic material of interest. 

4. The adenovirus-based gene therapy vector of claim 2 having substantially the same 
nucleotide sequence as shown in Table II (SEQ ID N0:3). 

15 

5 . An adenovirus-based gene therapy vector comprising adenovirus inverted terminal 
repeat nucleotide sequences and the minimal nucleotide sequences necessary for efficient 
replication and packaging and genetic material of interest. 

20 6. The adenovirus-based gene therapy vector of claim 5 having the adenovirus 2 
sequences shown in Figure 17. 

7. The adenovirus-based gene therapy vector of claim 5 further comprising PGK 
promoter operably linked to the genetic material of interest. 

25 

8. The adenovirus-based gene therapy vector of claim 5 in which the genetic material of ' 
interest is selected from the group consisting of DNA encoding: cystic fibrosis 
transmembrane conductance regulator, Factor VIII, and Factor IX. 

• 30 9. An adenovirus-based gene therapy vector comprising an adenovirus genome which 
has been deleted for all E4 open reading frames, except open reading frame 6, and 
additionally comprising genetic material of interest. 

1 0. The adenovirus-based gene therapy vector of claim 9 further comprising PGK 
35 promoter operably linked to the genetic material of interest. 

1 1 . The adenovirus-based gene therapy vector of claim 9 in which the Ela and Elb regions 
of the genome, which are involved in early stages of viral replication, have been deleted. 
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12. The adenovinis-based gene therapy vector of claim 9 in which the E3 region has been 
deleted. 

13. An adenovirus-based gene therapy vector comprising an adenovirus genome which 
5 has been deleted for all E4 open reading frames, except open readmg frame 3, and 

additionally comprising genetic material of interest. 

1 4. The adenovinis-based gene therapy vector of claim 1 3 in which the Ela and Elb 
regions of the genome, which are involved in early stages of viral replication, have been 

10 deleted. 

15. The adenovirus-based gene therapy vector of claun 13 fiirther comprising PGK 
promoter operably linked to the genetic material of interest. 

15 16. The adenovirus-based gene therapy vector of claim 1 3 in which the E3 region has 
been deleted. 

17. A method for treating or preventing cystic fibrosis in a patient comprising 
administering to the puhnonaiy airways of the patient, a gene therapy vector comprising 

20 DNA fflicoding cystic fibrosis transmembrane conductance regulator. 

1 8. The method of claim 1 7 wherein the gene therapy vector is an adenovirus-based gene 
therapy vector comprising the genome of an adenovirus 2 serotype in which the Ela and Elb 
regions of the genome, which are involved in early stages of viral replication, have been 

25 deleted and replaced by DNA encoding cystic fibrosis transmembrane conductance 
regulator. 

1 9. The method of claim 1 7 wherein the gene therapy vector fiirther comprises PGK 
promoter operably linked to the DNA encoding cystic fibrosis transmembrane conductance 

' 30 regulator. 

20. The method of claim 1 7 \s4ierein the gene therapy vector is an adenovirus-based gene 
therapy vector comprising adenovirus inverted terminal repeats and the minimal sequences 
necessary for efficient replication and packaging and DNA encoding cystic fibrosis 

35 tranmembrane conductance regulator. 

2 1 . The method of claim 20 wherein the gene therapy vector fiirther comprises PGK 
promoter operably linked to the DNA encoding cystic fibrosis transmembrane conductance 
regulator. 
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22. The method of claim 17 wherein the gene therapy vector is an adenovirus-based gene 
therapy vector comprising an adenovirus genome which has been deleted for all E4 open 
reading frames, except open reading frame 6, and additionally comprising DNA encoding 

S cystic fibrosis transmembrane conductance regulator. 

23. The method of claim 22 wherein the gene therapy vector further comprises PGK 
promoter operably linked to the DNA encoding cystic fibrosis transmembrane conductance 
regulator. 

10 

24. The method of claim 17 wherein the gene therapy vector is an adenovirus-based gene 
therapy vector comprising an adenovirus genome which has been deleted for all E4 open 
reading frames, except open reading fi^me 6, and has been deleted for the Ela and Elb regions 
of the genome, which are involved in early stages of viral replication, and additionally 

1 5 comprising DNA encoding cystic fibrosis tranmembrane conductance regulator. 

25. The method of claim 24 wherein the gene therapy vector further comprises PGK 
promoter operably linked to the DNA encoding cystic fibrosis transmembrane conductance 
regulator. 
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STRATEGY FOR CONSTRUCTING pKK- CFTRl 
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CONSTRUCTION OF THE pKK- CFTR2 PLASMID 
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STRATCGY FOR CONSTRUCTING THE pSC- CFTTU PLASMTD 
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Figure 4 
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MAP OF pSC-CFTR2 
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Figure 5 
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CONSTRUCTION OF THE pKK- CFTR3 cDNA 
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CONSTRUCTION OF THE pKK- CFTRS GLOME (cont'd.) 
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MAP OF pKK- CFTRS 
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Figure 12B 
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Figure 15 



17/50 



COimOL RAT INFECTED lATS 




1 2 3 4 56789 10 

Figure 16 



18/50 




872 
§03 
506 



Figure 17 



19/50 



% I 

100 
80 
60- 
40- 
20- 
0- 



% 
100- 

80' 

60- 

40- 

20- 

0 



1.1 





.1 



1.0 



0.9 




■Mm 







Figure 18A 



10 



14 



Controls 



Days After Infection 



1.7 



UmtUlfffl, 





1.9 



X N N \ > \ -x * 




"'I'l'IIIUIIII 





1.4 



iiiminiiiini 




Figure 18B 



CD Other 

□ Lymphocytes 
E3 Macrophages 
S Neutrophils 



L 



14 



Controls 



Days After Infection 




Figure 19 



21/50 



CO 





22/50 




9 



23/50 




Figure 22 



24/50 



ANTIBODY TITERS 



8000 
Figure 23A 



4000- 



First infection 
A • 



Second infection 



0-? 
20000 




Figure 23B 



Figure 23C 



Figtire 23D 

000 



Third Infection 




50 



100 



B A 








^ ir-\-. 


.1 




D 








1 ^ ^ 1 






1 

T 



150 
days 



25/50 




Figure 24 




Figure 25 




Figure 26 



28/50 




Figure 27 



30/50 




2 min 



31/50 



Vt 

(mV) 



0 

0- 
■20- 
•30*" 
.40- 
50 



.09 -7 0 




Figure 30A 



(mV) 



6 

4- 

2- 

0 
-2- 
-4- 
-6 



r 

I 
I 
I 
I 

I 
I 



Figure SOB 







i • • 



-89 -7 0 



10 
Day 



20 



T — -r- 

30 



40 



32/50 



Vt 
(mV) 



•10- 



-20- 



.30' 



-40^ 



.50 



256-189 -7 0 



I. 



2- 



AVt 
(mV) 



S- 



2- 



•4- 



( 
I 

I 



Figure 30C 



10 



— f 

20 



— : — 
30 



1 0' 

1 
1 



-255-189 -7 0 



— I — 
10 

Day 



— r- 

20 



— r~ 

30 



40 



Figxare 30D 



40 



33/50 



Vt 

(mV) 



20- 

-30- 

-40- 

-50- 

6- 
4* 

2- 

(mV) 0" 

-8- 




Figure 30E 



.117-25- 0 



4Q 



1 
! 



1 

1 • 



Figure 30F 



-117-25 0 



■ * 
10 

Day 



20 



30 



40 



34/50 




Figure 31 



35/50 



o 3^^ 



fMirrifi L^tz ifi n 



^ m II I I y I rf n 

^fi\}X Chuina CflCt-etTL ^ 

Figure 32 





Figvire 33 



»AdC40Qr6 



l^ — 



37/50 




38/50 




Figure 35 



39/50 




SUBSTITUTE SHEET (RULE 26) 



40/50 




SUBSTITUTE SHEET (RULE 26) 



41/50 




SUBSTITUTE SHEET (RULE 26) 



42/50 

CUNfCAL SIGNS MONKEY C 



AGE 7 YEARS 



DATE 



pyAMINATinM HeARTRATE RAT£ TEMPgtATURE WBGHT 



S/11/93 
5/11/93 
5/14/93 
5/18/93 

6/4/93 
6/18/93 
6/24/93 
6/24/93 
1 6/28/93 

7/5/93 
7/1!2/93 
9/1 7/93 



(beats/min) (breath/min) 



i>NjriiviML. 


1 12 


16 








NORMAL 


98 




NORMAL 


104 


16 


NORMAL' 


108 


16 


NORMAL 


112 


16 


NORMAL 


116 


18 




INFECTION 




NORMAL. 


104 


18 


granulation 


116 


16 


NORMAL 


114 


20 


NORMAL 


108 


16 



(Celsius) 
37.8 

38.1 
38.3 
38.2 
38.4 
38.8 

37.9 
37.4 

38.3 
38-3 



(Kg) 
6.4 



Figure 39A 



CUNfCAL SIGNS MONKEY D 



AGE 7 YEARS 



DATE 


EXAMINAIION 


HEARTRATE 


RESPRATE 


TEK«=eRA1XJR£ 


WEKSKT 






(beats/min) 


(breath/min) 


(Celsius) 


(Kg) 


5/11/93 


NORMAL 


108 


18 


38.3 


6.25 


5/11/93 




INFECTION 








S/14/93 


NORMAL 


100 


20 


38.4 




5/18/98 


NORMAL 


98 


20 


38.4 




6/4/93 


NORMAL 


106 


18 


37.9 




6/18/93 


NORMAL 


100 


19 


38.4 




6/24/93 


NORMAL 


106 


16 


37.8 




6/24/93 




INFECTION 




37-4 




1 6/^8/98 


NORMAL 


104 


16 




7/5/93 


NORMAL 


102 


14 


38.8 




7/12/93 


granulation 


114 


16 


38 




9/1 7/93 


NORMAL 


104 


16 


38.3 


6.4 



Figure 39B 



CLINICAL SIGNS MONKEY E 



AGE 11 YEARS 
WEKSKT 



DATE 



EXAMINATION HEARTRATE R^PRATE TEMPERATURE 



5/11/93 
5/1 1/93 
5/14/93 
S/18/93 

6/4/93 
6/18/93 
6/24/93 
6/24/93 
16/28/93 

7/5/93 
7/12/93 
9/17/93 



(beats/min) (breath/min) 



NORMAL 


120 


18 




INFECTION 




NORMAL 


112 


20 


NORMAL 


108 


22 


NORMAL 


112 


20 


NORMAL 


106 


20 


NORMAL 


108 


18 




INFECTION 




NORMAL 


112 


20 


NOf=lMAL 


106 


22 


NORMAL 


114 


1 6 


NOFIMAL 


1 14 


1 6 



(Celsius) 
28.3 

37.9 
38.4 
38.3 
38.3 
38.9 

38 
38.3 

38 
38.3 



(Kg) 
10 



8.75 



43/50 



Monkey C 



o 

CO 



9 

•—V 



C3 



I 

O 



o 

fa 

CO 

SI 

03 



3 



1- 



O O O O Ci CO CM ^ 

^ o> CO ?: ^ 

CO CO 



^ CI CM ^ 2 CO 



U5 ^ 



CO ^ o 
CO ^ CO 

CM 



XT a> c> <o ^ ^ 



^ « :i 



O 'T- ^ 



in ^ 

«^ ^* 1^: 



r- <0 CM K/> GO 

o CO 33 o «2 



cooooo ir> o 

4>*'S2l::S2co«^^ 



^ CO in ^ 
CO CM 



o> o o o o 

lO 'er CJ CO 

• in CO ^ 

CO CO 



CM 



in CO o 
^ o 

CO 



^ CD ^ CO 
O C4 r-^ 



^ CD CM CJ ^ ^ 



to 



CO 



^ o« ^ ^ 



t4 Oi 

o ^ <o 



o 

<o r^' 



o in <p m CD 1- 
o «^ ^ o 



CJ o rN. cj m 
o ci i: o «^ c> 



r~ O O O O 00 ^ CO ^ 
kI GO CO <n CM ^ ^ CO 

^ r-. CO "T- £f CO 



Oi O O Q O <0 CM CO 



00 r< o o> 



CM 



o <o J; 

CO XT 



CO 



o>oooo CM CO a> 

O CM CM ^ CO 

O CM lA CO 
CO 



<p o 

^ Q €Si 'T- 



CO <o <o O CO 
^ ^ O CM 



CM 1^^ 
^ OD ^ 



<D CO 
tn ^ ^ 



o> f*^. m CM m. o ^ 
^ o 
V 



o in 

CO 



CO a> <o CM <7> ^ 

^ in o ^ ^ d 
CO tn ^ 



fa 



crJ CO H 



^ZfaWOH-^OZ 



O O' o o 

^ in <o CM CO 



CM 



<x> ^ ^ :i 



CO ^ ^ 
CO ^ V 
CO 



o> <o o CO t-^eoocor-cow-;-^ 

^c;-^'-^«>^r--co;:ocoOo 




44/50 



Monkey D 



o 
I 



o 



E 

I 

CO 

-Si 



00 

e 



c 

oo 



t 



CO .o 

00 



o o o lo oo 



CO ^ 
CM CO CM J[ 
CO CO 



CO 



00 



CM 



flO CO <0 CM 

nj- O ^ 



00 to 0> 0> CO 

* c> o - 



CO <o U3 fN. 
" ^ o V 



CM 

o «^ ^ 

CM 



<Din^ cocoflo«-mcMv-^ 

^ CO CM V 



CO td CJ O 2 P 



^^4^ 

1^ ^ ^ 52 



CO CM O "H- 
CM 00 CO CM 



O CO to 

« ^ ^ 



^ OO o CO 
^ ^ ^ 

T- T- 



o> m o> o ^ 



O CD <0 
^ CO 



u> ^• <o 

«o o 

CO CO 



CO 



^ a ^ ^ 



CMCslC0«DC0CMCMIO0O 
€»>'0^C0<J>'CM«O*'>P 



t^OOOO CO ^ O) 
^ O lO ^ 52 CO 



A O O O O 

^ CO o ^ ^ 

O ^ CO CM 

CO CD 



r*^ o CO ^ 



to 



CO o o o 



CO 



CM CM ^ 



CMCMCO ^ (D<OCVtCOCM«— 

: in o CO ;;: c> 2 s; <5 



CMOOCAO CM rN.<g 



• o GO to 



O to CD O oo 



CO 



^ ^ 



CO CO 



o CM 



^ lO ^.CO CM CO ^ 

«^r^oco'5c>coOo 



fa ^ Pi w H 



rv.oooo O) to oo^ 
CO CO CO lo CO <^ 

00 CO ^ ^ CM 
CM CO 



^ 02 cn 

^8 § 



fs. ao 0> O O 

^ « ^ ^ - 



^ lOCOCD CO CM.CD'-OO'T- 



^ ^ CO O CO 



CM 



CM CM 

to 



< 



o 



8 5 S 



5 - ^ ^ 
5- ob ^ 



s 



: Q Q H H J ^ 
J S u] S £ a. cu 



= y 

CO Q 

< t:^ :d 



C3 
o 



[X4 



45/50 



Monkey E 



CM 



I? 
O 



oo 



X3 
9 



00 



C3 



I 

I 

oo 



CM Ift CJ -r- 

- ^ <0 CD oo 

in <SJ -r- 

CM m 



CO 



CO 



CO 
CO 



CO Csi ^ 
^ CO V 



2 V 

CM 



O CO o o 

^ CO «^ 



00 o> ^ 



CM o ift ^ 

^ <D ^ 0> 



to CM ^ O 
«0 CM O o 
CO CM V 



^ CO ^ 



o eu 



^f^^cor^fOtocMcor^^ 

CM V 



<o o o o o 

^ ^ ^ <o 

o <o ^ ^ 

CM to 



CO o o o 

• CO <o u> 

^ ^ CO CO 

^ CO 



CO o o o o 

^ ^ eo to 

CM to CM 

CO ^ 



CM 



CO O 
CO 



<D ^ 

CM 



CM 



0> <D ^ CO CO 

; « «^ - 



<o o CM to 
^ CO r ^ ^ 



m oo CO CO <p CM CO T— ^ 

<0 5Xt^C>C0^C>CM«^ 



<£> m CO T~ co<r-r>-rs.^ 



CO 



CO 



CM — 

CD 
CM 



Is^COf^O^-CMCM^-r- 
^ O CM - ^ O 



CM o CO 



^ o o o o to 

<0 CM CM ^ 

^ *0 CM U> ^ 

CM ^ 



CO 



1^ y 
CM 



^ CO O lO CD 



«o 



« CO i: 



CM CO CM 



^ CM o> 

CO Q C4 i£> cS 
^ CO V 



Est* fitf CO 



o o <z> o 

m CO CM CO 

flO O ^ . 

^ CO 



CM 



O -i- 
CM 



CO CO o <o CO to ^r^cotococMCMaj-r- 



g B f W 

H 3: y 

CO A S 

< 13 ;3 




o 
o 



U4 



wo 94/1Z649 



rci/uayj/iioo/ 



46/50 



€0 
C5> 



CO 



CO 



i 



CO 

eg 

CD 



CO 



CP 
CO 

o 

5! 



eo 



CO 



CO 



CO 
<7> 



CO 



Ui 

I— 
< 

Q 



cn o 

CO CO 



tn O o. €0 > 



OS ui O O z O 



^ to 

CM 



O O 



^ CM 

r** csi 



CO 

CO CO 



CO 



S 5 ^ CM O 



u. ^ a: CO H- 



GO O 

*x -^x ^ *~ 
CO CO 



S • ^ :^ S :^ 
f— -<= — !c j5 

QL "Si Q_ — 

to- « O ^ <o 

C/D C3 C O 

uj cr 2 >p uj 



lO 



to 



o u> ^ o 
CM " 



CQ ^ O a. CO >- 



CO Ul O O z o 



5 CM o o 



eg lo _ ^ 

N CM 



2 2J CM O 
CD CO 



U« OC H 



2 2^00 



P ^ S Z >»-c= 

fr ^ 8 S E o 



UI 



to 



to 



to 



CO to 

CM 



CM 



0 ^ O 6U CO >- 



CO Ui O O Z Q 



CM O O 



^ ^ — 
^ ^ ^ 



«^ S O CM O 



o O _ 
CO CO ^ 



— fie CO h- 



CO 



CO ^ 



iipri 

t ^ <D o E o 



wo 94/12649 



47/50 




Figure 42 



wo 94/12649 



48/50 



n;i7usy3/ii667 




Fj^ure 43 



wo 94/12649 



PCT/US93/11667 




wo 94/12649 



tVl7U5«/ll0to/ 



50/50 



NEUTRALIZING ANTIBODIES o 



CQ 

00 



U 

60 




saaiii AQoaiiNv 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
Intecnatioiial Bufeau 



PCX 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent CbssiBcation S - 
C12N 15/86, 15/12, A61K 48/DO 



A3 



(11) International PbMicatbn Number: 
(43) International Publication Date: 



WO 94/12649 

9 June 1994 (09.06.94) 



(21) international AppBcatbn Numben FCr/US93/l 1667 

(22) International Filing Date: 2 December 1993 (02.12.93) 



(30) Priority Data: 
07/985.478 
08/130.682 
08/136,742 



3 December 1992 (03.12.92) US 
1 October 1993 (01.10.93) US 
13 October 1993 (13.10^3) US 



(71) Applicant: GENZYME CORPORATION [US/US]; One 

KendaU Square, Cambridge. MA 02139 (US). 

(72) Inventors: GREGORY, Rkhaid, J.; 4789 (Jateshead Road, 

Carlsbad, CA 92008 (US). ARMENTANO, Donna; 33 
Clarver Road. Watertown. MA 02172 (US). COUTURE, 
Larry. A; 67 Circle Drive, Framingfaam, MA 01701 (US). 
SMTTH. A]an,E.; 88Cleveland Road, Wellesley, MA 02181 
(US). 

(74) Agents: HANUBY, Elizabedx, A. et aL; Lahive & C^ockfield, 
60 State Street, Boston, MA 02109 (US). 



(81) Designated States: AU, CA, JP, European patent (AT. BE, 
CH, DE, DK, ES, FR, GB, GR, IE, IT, LU, MC, NL, FT, 
SE). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendmerus, 

(88) Date of publication of the International search report: 

10 November 1994 (10.11.94) 



(54) Title: GENE THERAPY FOR CYSTIC FIBROSIS 
(57) Abstract 



Gene Then^y vectors, 
which are eq)ecia]ly useful for 
cystic fibrosis, and methods for 
using the vectors are disclosed. 
In preferred embodiments, tlie 
vectors are adenovirus-based. 
Advantages of adenovirus-based 
vectors for gene therapy are that 
they appear to be relatively safe 
and can be manipulated to encode 
the desired gene product and 
at the same time are inactivated 
in terms of their ability to 
replicate in a nmnal lytic 
viral life cycle. Additionally, 
adenovirus has a natural 
tropism for airway epithelia. 
Therefore, adenovirus-based 
vectors are particulariy prefeired 
for respiratory gene therapy 
applications such as gene 
therapy for cystic fito)sis. 
In one embodiment, the 
adenovirus-based gene therapy 
vector comprises an adenovirus 
2 serotype genome in which 
the Ela and Elb regions of the 
genome, which are involved in 
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early stages of viral replication have been deleted and replaced by genetic material of interest (e.g., DNA encoding die cystic fibroas 
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contain no potentially hamifid viral genes, have a tiieoretical cqracity for foreign material of neariy 36 kb, may be produced in reasonably 
high titers and fp«mtain the tropism of the parent adenovirus for dividing and noo-<lividing human taxget cell types. 
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X No required additional search fees were Umely paid by \he applicant. Consequently, this inusrnaUonal search report is 
— resu-icied to the invenUon first mentioned in Uie claims; it is covered by claims Nos.: 
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Remark on Protest Q The addiUonai search fees were accompanied by Uie appUcant's protest. 

I I No protest accompanied the payment of addiUonal search fees. 
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1. Claims 1-5,7,8,18 (completely); 11 , 14 , 24 ,.25- (partially) : 
Adenovirus-2 based vectors deleted for Ela and Elb genes 

2. Claims 9,10,12,13,15,16 (completely); 11,14,22-25 (partially): 

Adenoviral vectors deleted for all E4 open reading frames 
except 6 or 3 

3. Claims 1T,19t21 ( completely) ; 22 , 23 (partially): 

Gene therapy for Cystic Fibrosis by administering 
to the pulmonary airways of a patient a vector 
encoding CFTR gene 
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